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Princeton  VLSI  Project:  Semi-Annual  Report 

R.  J.  Upton 


1.  Introduction 

We  have  been  officially  underway  at  Princeton  since  August;  hence,  this 
report  actually  covers  about  three  months  of  activity.  There  are  three  major 
aspects  to  our  project:  AL1,  Census,  and  Testing. 

2.  All 

ALI.  our  procedural  language  for  VLSI  design  and  layout,  is  now  up  and  run* 
ning  [4].  It  has  already  been  used  by  Dobkin  and  Drysdale  to  redesign  a  divider 
they  previously  designed  on  a  graphics  system  at  Xerox.  In  addition.  LaPaugh's 
VLSI  class  has  already  used  ALI  for  their  VLSI  projects.  Dobkin  is  currently 
beginning  to  explore  ways  to  create  graphics  interfaces  to  the  ALI  system. 

ALI2  the  second  version  of  our  layout  system  is  coming  along  well.  Valdes 
has  already  fully  defined  the  new  language  and  implementation  is  now  under  way 
[7]  ALI2  differs  from  ALJ  in  two  essential  ways.  First,  it  is  based  on.  what  we 
believe  to  be  a  much  cleaner  set  of  primitive  and  constructs.  For  instance,  it 
does  not  have  the  "shuffle  property”  so  many  design  languages  do  by  this  we 
mean  that  the  order  of  placement  commands  does  matter.  Second,  AL12  gen¬ 
erates  tar  fewer  constraints  than  ALI  does.  This  is  of  course  critical  if  such  con¬ 
straint  based  systems  are  to  be  able  to  handle  large  complex  layouts. 

Plans  are  already  underway  on  how  to  best  exploit  the  unique  features  of 
AL12.  Ramachandran  has  just  finished  a  study  o'  the  cost  of  increasing  the  size 
of  drivers  in  order  to  speed  up  circuits  [5].  She  assumes  that  sizes  of  transis¬ 
tors  can  be  changed  but  that  the  layout  cannot  be  restructured.  Under  these 
assumptions  she  can  tightly  bound  the  worst  case  cost  in  terms  of  area  of  mak¬ 
ing  the  delays  on  all  wires  a  "constant".  Since  ALJ2  allows  a  designer  to  easily 
change  the  sizes  of  wires  and  transistors  we  feel  that  such  results  are  important 
Already  it  is  common  place  for  designers  to  size  transistors  for  speedup  in  an  ad 
Aoc  way  [6]. 

Also  Vijayan  and  Wigderson  have  isolated  a  number  of  new  layout  problems 
that  arise  naturally  when  one  considers  the  implementation  of  ALJ2  [6,9].  All 
these  problems  concern  the  embedding  of  "rectilinear”  graphs.  These  are 
graphs  where  each  edge  is  connected  to  the  "left”,  "top”,  "right", or  "bottom"  of 
each  vertex  They  are  currently  exploring  the  computational  complexity  of  a 
variety  of  layout  problems  here.  For  example,  they  can  quickly  recognize  those 
graphs  that  have  planar  embeddings,  moreover,  they  can  also  quickly  find  such 
embeddings  when  they  exist. 

3.  Census 

The  census  language  is  a  new  way  to  express  parallel  algorithms  that  use  a 
fairly  loosely  coupled  method  of  control  [3]  Work  is  under  way  to  understand 
the  limits  and  powers  of  such  languages  North  is  beginning  to  identify  those 
problem  areas  that  can  be  successfully  mapped  onto  the  census  language.  We 
are  also  thinking  about  implementations  of  census,  but  no  implementation  is  yet 
started 

Census  has  been  looked  at  from  a  theory  point  of  view  by  Chandra.  Fortune. 


and  Lipton  [2].  They  have  been  able  to  get  tight  upper  and  lower  bounds  on  the 
size  of  boolean  circuits  with  unbounded  fan-in.  Unbounded  fan-in  circuits  not 
only  model  an  important  class  of  census  computations  but  they  also  model  cir¬ 
cuits  such  as  PLA's.  For  example,  it  is  possible  to  construct  a  circuit  that  adds 
two  n-bit  numbers  in  constant  time  and  whose  size  is  approximately  linear  in  n. 
We  are  beginning  to  examine  the  feasibility  of  using  these  ideas  in  real  VLSI 
designs. 

4.  Testing 

Our  work  on  testing  divides  into  two  areas.  Arden  is  beginning  to  work  on 
first  silicon  testing  especially  with  respect  to  scanning  electron  microscopes 
(SEM)  He  plans  to  be  on  leave  this  spring  and  work  with  the  SEM  group  in  Mun¬ 
ich.  In  the  future  we  expect  that  we  will  be  able  to  use  a  SEM  that  the  Princeton 
Siemens  group  is  about  to  get. 

LaPaugh  and  lipton  have  begun  to  work  in  the  area  of  production  testing. 
They  have  already  successfully  been  able  to  completely  characterize  the  testa¬ 
bility  of  "prefix  computations".  Prefix  computations  arise  naturally  in  a  number 
of  places:  for  instance,  in  carry-look-ahead  adders  [l].  Such  characterizations 
link  the  self-test  of  these  computations  with  classic  semi-group  theory. 

Also  work  is  under  way  on  a  new  self-test  strategy  which  we  call  "toggle 
search".  This  method  first  generates  the  vector  of  ail  O  s;  next  it  randomly 
changes  one  of  the  0’s  to  a  1;  it  repeats  this  until  all  0*s  have  been  toggled  to  1's, 
then  the  entire  process  begins  again.  Toggle  search  and  its  generalizations  are 
well  suited  to  many  test  environments  where  one  bit  of  an  input  can  be  changed 
more  quickly  than  a  whole  input  vector.  Already  we  have  empirical  evidence  of 
the  superiority  of  toggle  search  over  other  methods.  Colleagues  at  IBM  Watson 
Research  have  tested  toggle  on  examples  of  about  one  thousand  gates  and  found 
that  it  is  about  three  times  faster  than  standard  methods.  For  example,  it  took 
about  600  thousand  vectors  versus  1.8  million  to  achieve  100%  fault  coverage  on 
one  piece  of  random  control  logic  We  plan  further  experiments  to  further  vali¬ 
date  these  results 

Finally,  we  have  also  found  a  way  to  transform  any  combinational  logic  cir¬ 
cuit  into  one  that  is  easy  to  test.  Here  by  easy  to  test  we  mean  that  we  can 
detect  a  very  large  class  of  physical  faults.  The  penalty  for  this  transformation 
is  that  the  number  of  gates  can  increase  by  as  much  as  100%.  We  plan  this  com¬ 
ing  year  to  carefully  explore  this  new  method.  In  particular,  we  wish  to  both 
understand  the  cost  of  the  method  and  the  class  of  faults  it  can  and  cannot 
detect. 
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Abstract:  The  first  component  of  a  VLSI  design  environment  being  built  at  Princeton  is 
described.  The  general  theme  of  this  effort  is  to  make  the  design  of  VLSI  circuits  as  simi¬ 
lar  to  programming  as  possible.  We  are  trying  to  build  tools  that  do  for  the  VLSI  circuit 
designer  what  the  best  software  tools  do  for  the  implementor  of  large  software  systems. 
The  work  described  here  is  a  procedural  language  to  specify  circuit  layouts. 


1.  Introduction 

In  this  paper  we  describe  a  very  important  component  of  any  VLSI 
design  environment:  a  tool  to  automate  the  layout  of  circuits.  This  work 
is  part  of  an  effort  to  create  an  integrated  environment  for  VLSI  design 
(including  layout  systems,  device  and  switch  level  simulators  and  testing 
facilities)  currently  under  way  at  Princeton. 

Our  main  thesis  is  that  the  VLSI  design  task  can  be  profitably  thought 
of  as  a  programming  task,  as  opposed  to  a  geometric  editing  task.  We 
believe  that  much  is  to  be  gained  by  consciously  attempting  to  apply  our 
knowledge  about  programming  to  this  new  activity.  We  have  thus  tried  to 
create  tools  for  the  VLSI  designer  that  incorporate  the  most  useful 
features  of  the  software  develpoment  tools  that  we  are  familiar  with. 

Although  we  feel  that  we  have  had  moderate  success  in  this  endeavor 
we  are  well  aware  of  how  much  room  for  improvement  we  have  left,  and 
would  like  to  help  convince  the  community  of  people  interested  in  the 
design  of  programming  language  and  programming  environments  that 
there  are  fresh  and  important  challenges  in  this  relatively  new  direction. 

A  prototype  of  the  procedural  layout  language  described  in  this 
paper  has  been  operational  for  some  months.  All  figures  given  in  thic 
paper  were  generated  by  the  language  and  all  the  code  fragments  have 
been  used  as  part  of  larger  programs. 

Portion  of  thio  peper  tpp owed  tr  Uw  Procodlno  of  Un  IMS  ACM  fympartvim  on  Principal  of  Propon 
wbnUmuot—  oadtatlw Prooo «dln|» of  U»  IMS  Doi(n  Automatic Oonformeo 


2.  A U:  a  procedural  language  to  describe  layouts 

The  main  feature  of  ALI  as  a  layout  language  is  that  it  allows  its  user 
to  design  layouts  at  a  conceptual  level  in  which  neither  sizes  nor  posi¬ 
tions  (absolute  or  relative)  of  layout  components  may  be  specified. 
Mostly  as  a  consequence  of  this,  ALI  simultaneously  (i)  makes  the  layout 
task  more  like  programming  than  editing,  (ii)  eliminates  the  need  for 
design  rule  checking  after  the  layout  is  generated,  (iii)  permits  the  crea¬ 
tion  of  easy  to  use  cell  libraries  and  (iv)  provides  the  designer  with  the 
mechanisms  to  describe  a  layout  hierarchically  so  that  most  of  the  detail 
at  one  level  of  the  hierarchy  is  truly  hidden  from  all  higher  levels. 

The  notion  of  not  assigning  sizes  or  positions  to  any  object  in  a  layout 
until  the  complete  layout  has  been  described  (similar  to  the  idea  of 
delayed  binding  in  programming  languages),  sets  ALI  apart  not  only  from 
just  about  all  of  the  graphics  based  layout  editors  we  know  of  ([3],  [4], 
[7],  [13],  [17])  but  also  -  with  the  exception  of  [14]  —  from  most  of  the 
procedural  languages  for  the  layout  task  currently  in  use  or  recently  pro¬ 
posed,  whether  or  not  they  include  a  graphics  interface  ([1],  [4],  [5],  [8], 
[9],  [10],  [15]). 

The  issues  that  we  tried  to  address  with  All  are  the  following. 

•  The  creation  of  an  open  ended  tool.  Graphics  editors  tend  to  be  closed 
tools  in  that  it  is  hard  to  automate  the  layout  process  beyond  what  the 
original  design  of  the  system  allowed.  Procedural  languages  are  gen¬ 
erally  much  better  in  this  respect.  However,  the  fact  that  most  such 
languages  require  the  specification  of  absolute  sizes  and  positions, 
makes  the  creation  of  a  general  purpose  library  of  cells  a  hard  task, 
since  information  about  the  sizes  and  positions  of  the  cell  elements  that 
can  interact  with  the  outside  world  has  to  be  apparent  to  the  user  of 
the  library.  The  absence  of  absolute  sizes  and  positions  makes  this 
problem  much  less  severe  in  ALI  The  extensibility  of  ALI  derives  from 
the  fact  that  it  has  been  built  on  top  of  Pascal,  thereby  making  the  full 
power  of  Pascal  available  to  the  designer.  The  generation  of  tools  to 
automate  the  layout  process,  such  as  simple  routers  or  PLA  generators, 
involves  writing  Pascal  routines  to  solve  some  abstract  version  of  the 
problem  and  having  done  so  invoke  ALJ  cells  to  generate  the  layouts 

•  Creating  tools  that  are  simple  to  use  and  easy  to  learn.  In  particular, 
we  want  to  avoid  tools  whose  behavior  is  unpredictable.  Many  programs 
which  rely  heavily  on  sophisticated  heuristics  respond  to  small  changes 
in  their  input  with  wholesale  changes  in  their  output.  We  have  main¬ 
tained  a  simple  correspondence  between  the  text  of  an  ALI  program  and 
the  resulting  layout  so  that  changes  in  the  layout  can  be  easily  related 
to  changes  in  the  program.  This  decision  has  simplified  the  system  at 
the  cost  of  making  it  less  knowledgeable  about  MOS  circuits 

•  Facilitating  the  division  of  labor.  Large  layouts  have  to  be  produced  by 
more  than  one  designer.  If  the  piece  produced  by  each  designer  is 
specified  in  absolute  positions,  serious  problems  are  likely  to  arise  when 
the  different  pieces  are  put  together,  unless  very  tight  interaction  - 
with  its  attendant  penalties  in  productivity  —  is  maintained  throughout 
the  design.  ALI  allows  the  partitioning  of  tasks  in  such  a  way  that  the 
designer  of  a  piece  of  the  layout  does  not  need  to  know  anything  about 


the  sizes  of  other  pieces  of  the  complete  layout.  For  instance,  on  the 
top  of  fig.  1  three  simple  cells  are  shown  with  the  intended  connections 
between  them  shown  by  dotted  lines;  on  the  bottom  of  the  figure,  the 
pieces  have  been  brought  together  to  form  a  larger  piece.  The  stretch¬ 
ing  that  has  taken  place  has  occurred  without  the  designer  having  to 
plan  for  it  explicitly  while  considering  each  individual  cell. 
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Three  separate  cells  and  the  result  of 
connecting  them  along  the  dotted  lines 


Facilitating  hierarchical  design.  Even  when  a  single  designer  is 
involved,  the  ability  to  view  a  layout  as  a  hierarchy,  with  as  much  infor¬ 
mation  about  lower  levels  completely  hidden  from  higher  levels,  is 
extremely  useful.  In  ALI,  the  information  about  a  given  level  of  the 
hierarchy  needed  at  the  level  immediately  above  is  reduced  by  the 
absence  of  absolute  sizes  and  positions,  to  topological  relations  among 
the  layout  elements  of  the  lower  level  visible  to  the  higher  one. 


•  Reducing  the  life  cycle  cost  of  layouts.  Modifying  a  layout  to  be  fabri¬ 
cated  on  a  new  process,  or  to  make  it  conform  to  a  new  set  of  design 
rules,  is  currently  a  costly  operation.  Yet  successful  designs  seem  to  be 
more  or  less  continuously  updated  as  improved  processes  become  avail¬ 
able  during  their  lifetime.  Fig.  2  shows  two  instances  of  a  simple  layout 
produced  with  AL1.  The  instances  are  the  result  of  running  an  ALI  pro¬ 
gram  twice  changing  exactly  four  constants  in  the  program  in  between 
runs  (those  that  specified  the  sizes  of  power  and  ground  buses).  This 
type  of  flexibility  addresses  the  problem  directly.  An  ALI  program  can 
be  written  naturally  so  that  all  layouts  produced  by  it  are  completely 
free  of  design  rule  violations,  no  matter  what  the  values  of  the  con¬ 
stants  used  in  the  programs.  Therefore  the  need  for  costly  design  rule 
checking  of  different  instances  of  a  layout  (see  fig.  2)  can  be  avoided 
The  same  ALI  program  can  also  generate  layouts  using  different  design 
rules  by  running  it  with  a  new  module  incorporating  the  new  design 
rules. 

•  To  avoid  the  need  for  special  purpose  computing  equipment.  ALI  can 
be  used  effectively  from  a  standard  ASCII  terminal  in  combination  with 
a  small  plotter  shared  by  several  designers.  All  the  algorithms  used  in 
the  inner  cycle  of  ALI  require  linear  time,  therefore  permitting  the  use 
of  just  about  any  machine  and  guaranteeing  fast  turnaround  on  small 
layouts.  Furthermore  ALI  replaces  design  rule  checking  by  a  hierarchi¬ 
cal  process  that  can  be  performed  separately  on  the  individual  pieces  of 
the  layout.  For  example,  after  checking  that  each  of  the  pieces  shown 
on  the  top  of  fig.  1  is  free  of  design  rule  violations,  their  combination 
shown  on  the  bottom  of  the  same  figure  will  be  guaranteed  by  ALI  to  be 
free  of  rule  violations  regardless  of  the  stretch  that  takes  place  as  a 
consequence  of  connecting  them.  ALI  in  fact  requires  far  fewer  comput¬ 
ing  resources  than  many  design  rule  checking  programs. 

We  feel  that  ALI  succeeds  in  partially  solving  most  of  these  problem 
We  do  not  claim  however  to  have  made  the  layout  task  trivial.  To  use  a 
software  metaphor,  we  feel  that  ALI  elevates  the  work  of  the  layout 
designer  from  absolute  machine  language  programming,  to  programming 
in  a  relocatable  assembler  with  subroutines.  This  not  only  makes  the 
task  more  pleasant  but  makes  new  and  more  powerful  tools  possible 
such  as  loaders,  linkers  and  compilers  in  the  case  of  software.  Similar 
tools  for  the  VLSI  world  —  which  would  indeed  simplify  the  layout  task 
enormously  —  remain,  however,  to  be  written.  ALI  should  stand  or  fall 
with  its  ability  to  allow  such  tools  to  be  built:  whether  we  are  right  in 
believing  that  we  have  a  framework  in  which  these  tools  can  be  more 
easily  implemented  will  not  be  known  until  our  efforts  in  that  direction 
succeed  or  fail. 

The  remainder  of  this  section  is  devoted  to  a  survey  of  the  main 
features  of  ALI  and  a  brief  discussion  of  its  current  implementation 
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2.1.  An  overview  of  ALI 

The  basic  principles  of  ALI  are  quite  simple.  A  layout  is  regarded  as 
a  collection  of  rectangular  objects  (with  their  sides  oriented  in  the  direc¬ 
tion  of  the  axes  of  a  cartesian  coordinate  system)  and  a  set  of  relations 
among  these  rectangles.  The  ALI  user  specifies  a  layout  by  declaring  the 
rectangles  (also  called  boxes )  of  which  it  is  composed,  and  stating  the 
relations  that  hold  between  them.  ALI  then  generates  a  minimum  area 
layout  that  satisfies  all  the  relations  between  boxes  specified  in  the  pro¬ 
gram.  For  example,  fig.  3  shows  a  trivial  ALI  program  and  the  layout  it 
produces. 


drip  nmplc; 


hnumber  =  10; 
length  =  20; 
width  =  6; 

baatype 

htype  =  array  [ 1..  hnumber]  at  metal; 


i ;  integer; 
bos 

horizontal ;  htype; 
vertical :  metal; 

begin 

fori  ;=  1  to  hnumber- 1  do  begin 

above  ( horizontal[i],  horizontal  [i+j] ); 
glueright  ( harizontal[i],  vertical ); 
zmore  ( horizontal  [i],  length  ) 
end; 

glueright  (  horizontal[hnumber] ,  vertical ); 
zmore  (  Aerixontal[Amtmbrr],  length  ); 
zmore  (vertical,  width ) 

end 


ng.3 

A  simple  ALI  program  and  the  layout  it  produces 


This  program  looks  very  much  like  a  Pascal  program:  it  consists  of  a 
declarative  part,  followed  by  an  executable  part.  To  declare  a  box  the 
user  specifies  its  name  ( horizontal  or  vertical  in  the  example),  and  it s 
type,  {metal  —  a  predefined  type  -  in  the  example).  The  standard  box 
types  correspond  to  the  layers  of  the  physical  layout.  As  the  example 
also  shows,  the  AU  user  can  define  structured  objects  (an  array  in  the 
example).  Further  details  on  the  type  structure  of  ALI  can  be  found  in 
section  2.2.1. 

The  relations  between  the  rectangles  that  make  up  a  layout  are 
specified  in  ALI  through  calls  to  a  small  set  of  primitive  operations  in  the 
executable  part.  All  such  operations  take  as  arguments  boxes  and  possi¬ 
bly  values  of  standard  Pascal  types  (integers  in  our  example).  In  our 
example  above,  glueright  and  zmore  are  primitive  operations.  The  primi¬ 
tive  above  specifies  that  its  first  argument  must  appear  above  the  second 
one  in  the  final  layout,  the  primitive  glueright  extends  its  first  argument 
to  the  right  to  intersect  its  second  argument,  and  xmore  makes  the  size 


of  its  first  arguments  along  the  x  axis  at  least  as  large  as  the  value  of  the 
second  argument.  Note  that  in  this  example  ALI  has  determined  the 
minimum  separation  between  the  horizontal  elements  as  well  as  the 
minimum  sizes  of  boxes  not  specified  by  xmore  (such  as  the  height  of  the 
horizontal  metal  lines)  by  accessing  a  table  of  design  rules.  More  infor¬ 
mation  about  the  type  structure  and  the  primitive  operations  of  ALI  is 
given  in  the  next  section. 

When  an  ALI  program  is  executed  it  generates  two  kinds  of  informa¬ 
tion.  It  produces  a  set  of  linear  inequalities  involving  the  coordinates  of 
the  corners  of  the  boxes  in  the  layout  as  variables.  These  inequalities, 
which  embody  the  relations  between  the  rectangles  of  the  layout,  are 
then  solved  to  generate  the  positions  and  sizes  of  the  layout  elements.  A 
brief  description  of  the  problems  involved  in  this  step  can  be  found  in 
section  2.3.2.  The  program  also  produces  connectivity  information  about 
the  rectangles  in  the  layout,  Tliis  information  can  then  be  used  by  a 
switch  level  simulator  that  predicts  the  behavior  of  the  circuit  as  laid  out 
whithout  having  to  perform  the  ususal  "node  extraction”  analysis. 

In  order  for  the  layouts  produced  by  an  ALI  program  to  be  free  of 
design  rules,  the  program  must  be  complete,  in  that  every  pair  of  rectan¬ 
gles  in  it  must  be  related  in  some  way.  Two  rectangles  may  be  related 
explicitly  in  the  user  program  by  virtue  of  being  arguments  to  a  primitive 
operation,  or  they  may  be  related  through  the  transitivity  of  the  separa¬ 
tions.  The  reason  for  this  strong  requirement  is  to  prevent  the  area 
minimization  process  from  shoving  together  rectangles  that  were 
intended  to  be  separate  (see  section  2.3.3  fo  a  discussion  of  complete¬ 
ness). 

ALI  helps  the  designer  to  achieve  this  goal  by  generating  certain  rela¬ 
tions  between  layout  elements  in  an  automatic  fashion,  and  by  checking 
on  request  whether  this  condition  is  satisfied.  It  is  however  the  responsi¬ 
bility  of  the  user  to  make  an  ALI  program  complete  in  this  sense,  as  the 
computational  cost  of  doing  any  sophisticated  inference  (beyond  the 
transitivity  of  relations  such  as  above)  is  prohibitive  [16]. 


2.2.  Main  features  of  ALI 

This  section  describes  how  ALI  appears  to  its  user  Its  threp  subsec¬ 
tions  deal,  in  turn,  with  the  type  structure,  the  primitive  operations  of 
the  language  and  the  cell  mechanism.  ALI  has  been  built  on  top  cf  Pascal 
and  has  inherited  most  of  its  features.  In  the  interest  of  shortening  this 
section  we  have  assumed  a  certain  familiarity  with  the  general  features  of 
Pascal. 


2.2.1.  Type  structure 

As  the  example  of  fig.  3  shows,  the  objects  manipulated  by  ALI  are 
declared  by  stating  their  name  and  their  type.  The  types  of  ALI  have  the 
same  structure  as  the  Pascal  types.  Objects  can  be  of  a  simple  type 
(boxes)  or  of  a  structured  type. 
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There  are  a  small  number  of  standard  types,  all  of  them  simple.  The 
standard  types  correspond  to  the  layers  of  the  process  to  be  used  to 
fabricate  the  layout  (metal,  poly,  diff,  impl,  cut  and  glass  in  the  NMOS 
version  currently  implemented)  plus  the  type  virtual,  used  to  name 
bounding  boxes  and  having  no  physical  reality  in  the  fabricated  circuit. 
For  example,  in  the  program  of  fig.  2,  the  declaration 

vertical :  metal 

specifies  that  the  rectangle  named  vertical  on  the  final  layout  should  be 
on  the  metal  layer.  ALI  will  use  this  information  to  generate  constraints 
on  its  minimum  size  and  its  separation  from  other  layout  elements. 

Structured  types  are  of  two  flavors:  array  (a  collection  of  objects  of 
the  same  type)  and  bus  (a  collection  of  objects  of  heterogeneous  types, 
much  like  records  in  Pascal),  which  correspond  directly  to  the  array  and 
record  structured  types  of  Pascal.  ALI,  like  Pascal,  permits  the  creation 
of  new  user  defined  types  that  can  be  either  simple  or  structured.  For 
example,  in  fig.  3,  the  fragment 

htype  =  array  \l..hnumber)  of  metal 

inside  the  boxlype  section  of  the  program,  creates  a  new  type,  htype, 
each  object  of  that  type  made  up  of  a  number  of  metal  rectangles,  and 
the  fragment 

horizontal  :  htype 

inside  the  box  section,  creates  an  object  of  that  type  named  horizontal 

In  a  similar  fashion  the  type  declaration 

shift  bus  -  bus 

phi,  ph2 :  metal; 
vdd  :  metal; 
data  :  diff; 
gnd  :  metal 

end 

creates  a  user  defined  type,  allowing  the  user  to  create  objects  which 
consist  of  four  metal  boxes  and  a  diffusion  box.  The  types  of  the  com¬ 
ponents  of  structured  types  are  arbitrary:  the  user  can  define  arrays  of 
buses,  or  buses  containing  arrays. 

The  accessing  of  the  elements  of  arrays  and  buses  is  done  as  in  Pas¬ 
cal.  Thus  if  x  is  of  type  htype  then  z[i]  refers  to  the  i-th  element  of  z,  and 
if  y  is  of  type  shiftbus  then  y .  data  refers  to  the  diffusion  box  of  y 

Although  the  structured  objects  are  generally  used  by  ALI  simply  as  a 
naming  mechanism,  they  are  also  used  in  conjunction  with  the  cell 
mechanism  (discussed  in  section  2.2.3)  to  automatically  generate  separa¬ 
tions  between  boxes.  We  will  be  more  precise  on  this  point  when  wc 
describe  the  cell  mechanism  of  ALI. 


Like  Pascal,  AU  is  a  strongly  typed  language.  The  primitive  opera¬ 
tions  know  about  certain  type  restrictions  and  generate  type  mismatch 
errors  if  operations  are  attempted  with  rectangles  of  inappropriate 
types. 


2.2.2.  Primitive  operations 

The  relations  between  the  rectangles  that  make  up  a  layout  are 
specified  in  ALI  through  calls  to  a  small  set  of  primitive  operations.  All 
such  operations  take  boxes  (i.e.,  objects  of  simple  types)  as  arguments. 
In  the  program  of  fig.  3,  above,  glueright  and  xmore  are  primitive  opera¬ 
tions. 

It  is  not  important  to  know  the  actual  primitive  operations  of  the 
current  version  of  ALI  to  understand  its  operation.  As  a  gross  measure  of 
its  complexity  we  can  say  that  the  system  currently  implemented  - 
based  on  NMOS  as  described  in  [12]  —  has  about  twenty  primitive  opera¬ 
tions  which  can  be  arranged  in  the  following  groups: 

1  Separation  primitives:  such  as  above  in  fig.  3,  which  specify  that  their 
arguments  must  be  separated  in  a  certain  direction  in  the  final  layout. 
The  minimum  amount  of  space  between  boxes  separated  in  this  manner 
depends  on  their  types  and  is  supplied  by  ALI  from  a  table  of  design 
rules. 

2  Connection  primitives:  such  as  glueright  in  fig.  3,  to  specify  that  their 
arguments  —  which  must  be  boxes  in  the  same  layer  --  are  to  be  joined 
in  a  particular  manner. 

3  An  inclusion  primitive,  inside,  that  specifies  that  one  box  is  to  be 
placed  inside  another.  The  minimum  distances  between  their  edges  are 
again  suplied  by  ALI  from  a  table  of  design  rules. 

4  Minimum  size  primitives:  such  as  xmore  in  fig.  3,  which  specify  the 
minimum  size  of  a  box  along  a  certain  direction.  Default  minimum 
sizes  are  provided  by  ALI  from  a  design  rule  table. 

5  Transistor  primitives,  which  create  depletion  mode  and  pass  transis¬ 
tors. 

6  Contact  Primitives,  which  create  contacts  between  layers  and  connect 
boxes  to  them. 

Note  that  no  absolute  positions  or  dimensions  for  any  rectangle  can 
be  specified  with  these  primitives.  All  the  rectangles  of  a  layout  can  be 
stretched  and  compressed  (up  to  a  minimum  size)  and  all  can  float  in  any 
direction.  If  one  single  characteristic  is  to  be  used  to  separate  AU  from 
other  layout  systems,  this  must  be  it.  Most  of  the  power  of  Al.i  and  most 
of  the  problems  one  faces  in  its  implementation  are  consequences  of  this 
fact. 

It  is  important  to  remember  that  in  order  for  a  layout  produced  by 
ALI  to  be  free  of  design  rule  violations,  any  two  rectangles  in  it  must  be 
related  in  some  way.  ALI  will  make  no  inferences  as  to  the  relations 
between  boxes  beyond  those  implied  by  the  transitivity  of  some  primitive 
operations  (i.e.,  if  above  (a,  b)  and  above  (b,  c)  are  stated,  above  (a.  c) 
need  not  be  stated).  Although  the  system  generates  a  good  number  of 


relations  automatically  lor  the  user,  particularly  in  connection  with  the 
cell  mechanism  (see  the  next  subsection),  there  is  still  a  fair  amount  of 
drudgery  left  for  the  user  in  making  sure  that  this  requirement  is  met.  A 
brief  discussion  on  the  computational  complexity  of  the  automatic  gen¬ 
eration  of  relations  between  boxes  can  be  found  in  section  2.3.3. 


2.2.3.  Cells 

Perhaps  the  most  powerful  feature  of  ALI  is  its  procedure-like 
mechanism  for  the  definition  and  creation  of  cells.  A  cell  is  a  collection 
of  related  rectangles  enclosed  in  a  rectangular  area.  Rectangles  that  are 
inside  a  cell  are  of  two  types:  local  which  are  invisible  to  the  outside,  or 
parameters  which  can  interact  in  a  simple  and  well  defined  manner  with 
rectangles  outside  the  cell. 

A  cell  is  defined  by  specifying  its  local  objects,  its  formal  parameters 
and  the  relations  among  all  of  them.  Once  a  cell  has  been  defined,  it  can 
be  instantiated  as  many  times  as  desired  by  specifying  the  actual  param¬ 
eters  for  the  instance,  much  the  same  way  as  one  invokes  a  procedure  or 
function  in  a  procedural  language.  The  result  of  instantiating  a  cell  is  to 
create  a  brand  new  copy  of  the  prototype  described  in  the  cell  definition 
with  the  formal  parameters  connected  to  the  actual  parameters. 

A  cell  definition  is  made  up  of  a  header,  in  which  the  formal  parame¬ 
ters  are  described,  a  set  of  Local  box  declarations  and  a  body  in  which  the 
relationship  between  the  parameters  and  the  local  boxes,  as  well  as  those 
among  local  boxes,  are  specified. 

The  header  describes  the  names  and  types  of  the  parameters  and  the 
side  of  the  bounding  rectangle  through  which  they  come  into  contact  with 
the  inside  of  the  cell.  The  header  of  a  cell  (using  the  type  shiftbus 
defined  in  section  2.2. 1)  and  an  instance  of  it  are  shown  in  fig  4 


call  shift  (left  l :  shiftbus;  right  r  :  shiftbus  ) 


Cells  may  have  any  number  of  parameters  on  each  of  their  four  sides. 
The  order  in  which  they  are  listed  in  the  cell  header  describes  their  rela¬ 
tive  positions.  Horizontal  parameters  (i.e.,  those  touching  the  cell  on  the 
left  or  right)  are  assumed  to  be  listed  in  top  to  bottom  order  and  vertical 
parameters  in  left  to  right  order. 

The  body  of  a  cell  is  very  much  like  an  ALI  program.  For  example,  fig. 
5  shows  a  complete  cell  definition  that  consists  of  a  variable  number  of 
shift  cell  instances  connected  sequentially  together  with  two  of  its 
instances.  Note  that  cells  are  instantiated  by  the  create  statement,  and 
that  the  parameter  list  of  the  cell  contains  both  box  parameters  and 
other  parameters  (an  integer  in  this  case)  in  separate  lists.  Note  also 
that  recursion  has  been  used  to  define  this  cell;  this  highlights  the  fact 
that  ALI  has  the  full  power  of  Pascal  at  its  disposal. 

When  an  instance  of  a  cell  is  created  it  can  be  given  a  name,  provided 
that  the  name  given  has  been  declared  as  a  rectangle  of  the  standard 
simple  type  virtval.  The  relationship  of  the  rectangle  bounding  a  newly 
created  cell  to  any  other  rectangle  of  the  layout  can  be  specified  in  the 
standard  manner  by  calls  to  the  primitive  operations.  This  is  a  vital 
feature  since  in  many  cases  (i.e.,  above,  beLow...)  stating  a  relation 
between  two  cell  instances  ct  and  c2  immediately  implies  a  relation 
between  every  pair  of  rectangles  r,  and  r8  such  that  rj  is  part  of  c ,  and  r2 
part  of  Cg. 

There  are  two  important  ways  in  which  the  cell  mechanism  helps  in 
the  automatic  generation  of  constraints  between  boxes.  When  an  object 
of  a  structured  type  is  passed  as  a  parameter  to  a  cell,  its  component 
boxet,  are  separated  from  top  to  bottom  (if  it  is  a  left  or  right  argument) 
or  from  left  to  right  (if  it  is  a  top  or  bottom  argument).  The  order  of  the 
separation  is  determined  by  applying  recursively  the  following  rules: 
array  elements  are  separated  ordered  by  their  indices  and  bus  elements 
in  the  order  in  which  they  were  specified  in  the  bus  declaration.  Thus,  in 
the  example  of  fig.  5,  the  components  of  parameter  mfrus  would  be 
separated  from  top  to  bottom.  The  second  mechanism  involves  the 
automatic  separation  of  cells  that  share  a  parameter;  thus  in  the  exam¬ 
ple  of  fig  5,  the  individual  instances  of  shift  are  separated  automatically, 
since  adjacent  instances  share  a  parameter. 

The  cell  mechanism  gives  the  ALI  user  the  ability  to  describe  layouts 
in  a  truly  hierarchical  manner.  A  proper  ALI  design,  very  much  like  a  well 
structured  program,  will  consist  of  a  hierarchy  of  cell  instances  with  only 
a  small  amount  of  information  at  a  given  level  (the  parameters  of  the  cell 
instances  at  that  level)  being  visible  from  the  immediately  higher  level. 
For  example,  the  layout  given  in  fig.  2  consists  of  four  instances  of  the 
same  cell  stacked  vertically.  That  cell  in  turn  is  defined  in  terms  of  three 
other  cells,  one  of  them  being  the  cell  shown  in  fig.  1,  which  is  in  turn 
defined  in  terms  of  three  other  cells. 

Much  of  the  power  and  generality  of  the  cell  mechanism  of  ALI  comes 
from  the  absence  of  absolute  positions  and  sizes  in  a  layout  specification. 
In  particular,  two  instances  of  the  same  cell  may  have  radically  different 
sizes  depending  on  the  actual  parameters  used  to  create  them,  as 
exemplified  by  figs.  1,  2  and  5.  We  believe  that  no  cell  mechanism  can  be 


cell  shaftragistar  (  left  tnbu*  ;  shift  bus, 
right  out&us  :  shaftbus  ) 

( tangth :  vniagar  ); 

tamp  ; shaftbus ; 
begin 

if  length.  -  1  then 

create  shaft  (  inbus,  ovtbus  ) 


dee  begin 

create  shaft  ( inbus,  tsmp  ); 

create  shaftragistar  ( tamp,  outbus  )  ( tangth  -1 ) 
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said  to  be  truly  general  u  4he  sizes  of  its  arguments  and  local  rec¬ 
tangles,  as  well  as  their  rei  e  tances  are  determined  at  the  tune  the 
cell  is  instantiated. 

There  are  some  penalties  involved  in  the  use  of  the  cell  mechanism 
In  particular,  ALI  generates  separations  between  cells  in  a  mariner  vrhicn 
is  oblivious  to  what  is  inside  them.  That  is,  the  minimum  separation 
between  cells  as  far  as  ALI  is  concerned,  is  the  maximum  of  all  the 
minimum  separations  for  two  layers  in  the  design  rules,  thus  creating  a 
certain  wastage.  We  believe  that  this  penalty  will  be  generally  a  small 
percentage  of  the  total  area  and  is  well  worth  the  advantages  gained  by 


the  ability  to  separate  cell  instances  as  units. 

Another  source  of  wastage  is  the  fact  that  cells  are  restricted  to  be 
bounded  by  a  rectangle,  so  the  packing  of  cells  that  have  irregular  shapes 
results  in  a  certain  amount  of  unused  space.  The  rectangular  shape  of 
the  cells  is  a  fundamental  characteristic  of  ALI:  The  introduction  of  irreg¬ 
ularly  shaped  cells  is  simply  not  possible  without  completely  redesigning 
the  language  and.  However,  the  waste  introduced  because  of  this  restric¬ 
tion  can  be  avoided  in  most  particular  cases  through  some  code 
modifications. 


2.3.  Implementation  issues 

The  previous  section  described  the  user  view  of  ALL  In  this  section 
we  discuss  briefly  some  of  the  problems  to  be  solved  when  trying  to  go 
from  an  ALI  program  to  a  layout  that  satisfies  the  relations  stated  in  it 
We  first  give  an  overall  description  of  the  system  as  currently  imple¬ 
mented,  then  discuss  the  method  used  to  assign  locations  and  sizes  to 
the  layout  elements  and  then  the  concept  of  completeness  and  how  it  is 
checked. 


2.3.1.  Overall  implementation 

The  current  version  of  our  system  has  been  implemented  as  follows. 
The  ALI  program  is  first  translated  into  standard  Pascal.  The  resulting 
Pascal  program  is  then  compiled  and  linked  with  a  precompiled  set  of 
procedures  that  implement  the  primitive  operations  and  the  resulting 
object  module  is  then  run.  The  output  of  this  program  (generated 
entirely  by  the  primitive  operations)  is  a  set  of  linear  inequalities  and 
connectivity  relations  among  the  layout  elements.  The  inequalities  are 
then  solved  to  generate  a  layout  or  examined  by  a  program  that  checks 
their  logical  completeness,  and  the  connectivity  information  can  be  used 
to  simulate  the  circuit  laid  out. 

The  design  rules  are  incorporated  as  a  table  which  is  used  by  the 
primitive  operations  to  produce  the  linear  inequalities.  Thus  changing 
the  design  rules  for  our  system  requires  only  to  change  this  table 


2.3.2.  Placement 

As  explained  above,  one  of  the  results  of  running  an  ALI  program  is  a 
set  of  linear  inequalities  that  embody  the  relations  between  the  layout 
elements.  These  inequalities  are  of  the  following  simple  form. 

Xi-xjXi  (dfcO) 

where  the  variables  are  the  coordinates  of  the  corners  of  the  boxes  that 
form  the  layout  and  the  constants  are  either  user  supplied  (as  in  the 
second  argument  of  the  zmore  primitive,  for  instance)  or  extracted  from 
the  table  of  design  rules  by  the  system  itself. 


This  set  of  inequalities  should  be  solved  so  as  to  generate  placements 
for  the  boxes  that  compose  the  layout  in  such  a  way  as  to  minimize  its 
total  area.  In  order  to  perform  this  task  efficiently,  we  require  that  no 
inequality  in  the  set  involve  both  x  and  y  coordinates.  This  restriction 
allows  us  to  minimize  the  total  area  by  minimizing  the  maximum  z  and  y 
coordinates  of  any  point  independently,  at  the  cost  of  reducing  the  range 
of  the  relations  between  boxes  that  we  can  express.  We  cannot,  for 
instance,  handle  rectangles  whose  sides  are  not  parallel  to  the  cartesian 
axes  or  express  aspect  ratios  directly. 

We  have  now  a  sufficiently  simple  problem  so  it  can  be  solved  in  time 
proportional  to  the  number  of  inequalities  in  our  set.  (All  layouts  that 
can  be  expressed  in  ALI  can  be  generated  by  a  program  that  produces  a 
constant  number  of  inequalities  per  rectangle).  This  is  done  by  a  version 
of  the  topological  sort  algorithm  [11]  applied  to  the  x  and  y  coordinates 
independently.  This  algorithm  assigns  to  each  point  the  lowest  possible 
coordinate  while  minimizing  the  largest  coordinate  of  all  points. 

The  form  of  the  inequalities  that  we  allow  is  rather  restrictive;  it  is 
sufficient  however,  to  describe  the  design  rules  given  in  [12]  for  NMOS, 
and  the  efficiency  gained  in  return  for  this  simplicity  seem  to  us  like  a 
good  tradeoff.  A  more  subtle  consequence  of  the  simplicity  of  the  ine¬ 
qualities  and  the  method  we  use  to  solve  them  is  that  undesirable 
stretching  can  occur,  since  we  have  no  way  to  specify  a  maximum  size  for 
any  object.  This  is  not  a  common  occurrence  and  the  user  can  in  all 
cases  guard  against  such  stretching  by  the  careful  selection  of  the  primi¬ 
tive  operations  used.  It  is  nonetheless  an  additional  burden  placed  on  the 
designer. 

The  choice  of  an  efficient  placement  algorithm  over  expressibility 
power  and  a  reduced  degree  of  user  convenience  has  been  quite  cons¬ 
cious  in  this  particular  case.  We  feel  that  every  reasonable  measure 
should  be  taken  to  keep  the  complexity  of  the  placement  problem  linear, 
given  that  the  size  of  layouts  is  currently  large  (107  rectangles)  and  is 
growing  fast.  Widening  the  class  of  linear  inequalities  acceptable  is 
almost  certain  to  make  linear  time  solutions  impossible  [2]. 


2.3.3.  Completeness 

AU  programs  do  not  involve  absolute  sizes  or  positions  of  boxes,  and 
are,  to  a  great  extent, independent  of  the  design  rules.  These  characteris¬ 
tics  make  it  clearly  desirable  to  insure  that  the  layout  described  by  a 
program  will  be  free  of  design  rule  violations  in  a  way  other  than  check¬ 
ing  the  finished  layout.  The  following  paragraphs  describe  a  way  of  insur¬ 
ing  freedom  from  design  rule  violations  in  a  manner  that  is  independent 
of  the  actual  design  rules  used  to  generate  the  final  placement.  The 
description  may  be  somewhat  cryptic;  the  interested  reader  is  referred 
to  [16]  for  further  details. 

A  layout  generated  by  an  AU  program  is  complete  if  for  any  two 
boxes  a  and  b  whose  types  make  it  possible  for  them  to  interact  in  the 
final  layout,  either 


(i)  a  and  b  are  explicitly  stated  to  be  in  contact  by  some  primitive 
operation,  or 

(ii)  o  and  6  are,  explicitly  or  through  the  transitivity  of  primitive  rela¬ 
tions,  stated  to  be  separated  in  either  the  x  or  the  y  direction  by  a 
minimum  amount  which  depends  on  their  types. 

From  this  definition,  it  should  be  clear  that  testing  completeness  of  a 
cell  instance  involves  computing  the  transitive  closure  of  a  graph.  There¬ 
fore  the  complexity  of  the  operation  will  be  0(ns),  where  n  is  the  number 
of  boxes  in  the  cell.  It  is  thus  not  feasible  to  test  a  large  layout  for  com¬ 
pleteness  in  a  direct  way. 

Fortunately,  completeness  can  be  checked  hierarchically.  Let  us 
look  only  at  the  objects  at  the  highest  level  of  the  hierarchy  of  boxes  that 
defines  a  layout,  i.e.,  those  boxes  (including  cell  boundaries)  defined  glo¬ 
bally  in  the  ALI  program  that  generated  the  layout.  If  these  objects  are 
related  in  a  complete  manner  and  the  cell  instances  used  at  this  level  are 
also  complete,  then  the  whole  layout  is  complete. 

Thus  one  can  check  the  completeness  of  a  layout  by  successively 
checking  cell  instances  for  completeness,  thereby  reducing  the  complex¬ 
ity  of  the  process  to  0(m3)  where  m  is  the  largest  number  of  boxes  local 
to  a  cell  instance  in  the  layout.  This  process  can  be  reduced  further  , 
since  not  every  cell  instance  needs  to  be  checked.  For  example,  if  a  cell 
is  defined  by  a  straight  line  program,  checking  one  instance  for  com¬ 
pleteness  suffices,  as  one  instance  of  the  cell  will  be  complete  if  and  only 
if  all  of  its  instances  are  [16],  The  case  of  cells  with  branches  and  itera¬ 
tion  is  not  quite  as  simple.  Yet  we  are  confident  —  and  our  experience 
tends  to  confirm  this  belief  —  that  checking  the  completeness  of  a  few 
carefully  selected  instances  of  any  cell  definition  will  be  enough  to 
guarantee  that  the  cell  definition  is  complete. 

The  end  result  is  that  completeness  has  the  flavor  of  a  static,  almost 
syntactic,  property  for  all  non  malicious  examples,  and  is  much  easier  to 
check  in  a  well  structured  layout  than  design  rule  freedom  by  the  stan¬ 
dard  means  on  the  final  layout. 

Finally,  a  word  about  the  possibility  of  taking  an  incomplete  layout 
specification  and  automatically  completing  it.  The  general  problem  of 
generating  an  optimal  completion  is  NP-Complete,  but  the  simpler  ver¬ 
sion  of  generating  any  completion  for  graphs  embedded  in  a  grid  (as  our 
layouts  are)  seems  to  be  solvable  in  0(n8)  steps.  The  question  of  how 
much  area  will  be  wasted  by  such  a  completion  algorithm  will  have  to  wait 
for  some  experimentation,  but  there  is  no  question  of  its  usefulness. 


2.4.  Experience  with  ALI 

The  current  implementation  of  ALI  has  shown  the  soundness  of  most 
of  our  original  ideas.  The  system  is  efficient  and  the  language  easy  to 
learn,  and  the  layout  it  produces  are  relatively  dense  (for  example,  an  ALI 
program  written  without  concern  for  area  optimization  produced  a  layout 
which  was  about  thirty  per  cent  larger  than  a  similar  layout  packed  by 


hand  on  a  graphics  editor.  Unfortunately,  this  evidence  has  been  gath¬ 
ered  mostly  from  people  who  had  a  hand  in  designing  or  implementing 
ALI.  Perhaps  a  more  reliable  evaluation  of  ALI,  ought  to  wait  until  a  sub¬ 
stantial  number  of  users  not  involved  in  its  design  can  give  an  informed 
opinion.  We  hope  to  obtain  this  evidence  before  long,  since  ALI  is 
currently  being  used  in  a  VLSI  design  course. 

The  fact  that  very  little  effort  was  invested  in  error  recovery  for  the 
sake  of  expediency  in  getting  a  prototype  running,  and  that  no  mechan¬ 
ism  for  integrating  separately  produced  layout  pieces  was  provided  make 
the  current  system  useful  mostly  for  teaching  purposes  and  experimen¬ 
tation.  It  must  be  emphasized  that  this  is  a  result  of  implementation 
choices,  and  not  of  any  intrinsic  limitation  on  the  approach  we  have 
taken. 

The  problems  of  the  current  system  which  we  plan  to  address  with 
the  next  version  are  the  following: 

(1)  Memory  requirements.  The  solution  of  the  system  of  linear  inequali¬ 
ties  requires  large  amounts  of  memory.  We  will  use  a  different  algo¬ 
rithm  which  is  slightly  less  efficient  in  terms  of  time  but  requires  an 
order  of  magnitude  fewer  memory  locations  for  a  typical  large  lay¬ 
out. 

(2)  Pascal  problems.  The  current  ALI  has  exactly  the  same  type  struc¬ 
ture  as  Pascal.  The  lack  of  generic  types  and  dynamic  arrays  has 
made  the  task  of  writing  general  purpose  tools  (PLA  generators, 
routers...)  inside  AU  more  difficult  than  it  ought  to  be.  The  next  ALI 
will  have  the  notions  of  generic  types  and  dynamic  arrays. 

(3)  Connecting  primitives.  Certain  objects,  such  as  contacts,  are  used 
frequently  enough  to  warrant  making  them  part  of  the  language. 

(4)  Separate  “compilation"  facilities.  Clearly,  large  layouts  will  have  to 
be  generated  in  pieces,  which  is  something  that  our  current  system 
cannot  do. 
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1.  Introduction 

The  problem  we  address  in  this  paper  is  an  embedding  problem  for  a  class 
of  graphs  which  we  call  rectilinear  graphs.  These  graphs  are  important  in  many 
VLSI  layout  problems.  In  fact,  this  problem  arose  in  the  implementation  of  ALJ 
[6.7].  a  procedural  language  for  VLSI  design  currently  under  development  at 
Princeton.  An  embedding  algorithm  can  be  used  to  automate  the  production  of 
VLSI  layouts  in  many  procedural  design  systems. 

Consider  the  following  model  for  VLSI  layout  design.  A  VLSI  layout  is 
described  hierarchically  using  cells  and  wires  that  connect  the  cells  together. 
Each  cell  C  is  enclosed  within  a  rectangle  if  (C).  and  has  four  tuples  of  pins,  one 
each  for  the  left,  top,  right,  and  bottom  of  rectangle  F(C).  Each  wire  w  is 
denoted  by  a  pair  of  pins  (ft  ft),  such  that  ft  and  pj  are  pins  of  different  cells 
and  are  of  opposite  types.  For  example,  if  ft  is  a  right  pin  then  ft  should  be  a 
left  pin.  Given  such  a  description  of  a  VLSI  layout,  our  aim  is  to  produce  an 
embedding  of  the  description  on  the  plane,  such  that  (i)  no  two  bounding  rec¬ 
tangles  touch  each  other,  (<i)  the  pins  appear  in  the  correct  order  on  the 
bounding  rectangles,  (tii)  the  wires  are  straight  and  rectilinear,  and  (fe)  no  two 
wires  cross  each  other.  Later  on,  we  can  fill  each  bounding  rectangle  R{C)  with 
the  embedding  of  the  cell  C  in  the  same  manner. 

The  restriction  that  wires  cannot  be  bent  may  seem  unrealistic,  but  this  is 
certainly  the  case  in  many  design  systems  including  AL1.  If  a  wire  has  to  be 
bent,  the  user  specifies  that  by  breaking  up  the  wire  into  several  straight  wires 
and  placing  cells  at  each  of  the  turn  points  of  the  wire.  In  ALJ,  for  example,  the 
user  can  incorporate  routing  algorithms  in  a  ALJ  program  to  determine  how  the 
wires  are  to  be  bent.  The  restriction  that  wires  cannot  cross  implies  that  we  are 
dealing  with  the  wires  on  a  single  layer.  For  a  layout  with  multiple  layers,  it  is 
elearly  necessary  that  the  wires  on  each  layer  do  not  oross. 

To  solve  the  above  embedding  problem,  it  is  enough  to  consider  only  a  sim¬ 
ple  restriction,  where  each  bounding  rectangle  has  at  most  one  pin  on  each  side. 
We  can  then  treat  the  bounding  rectangles  as  vertices  and  the  wires  as  edges, 
which  leave  in  one  of  the  four  cardinal  directions.  We  give  the  name  rvcNNfieor 
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graphs  to  such  graphs.  The  embedding  problem  of  rectilinear  graphs  is  our 
main  concern  in  this  paper. 

For  VLSI  applications,  we  need  efficient  algorithms  to  recognize  and  then 
actually  embed  rectilinear  graphs.  In  this  paper,  we  present  an  0(»)  recogni¬ 
tion  algorithm  and  an  0(n*)  embedding  algorithm,  where  n  is  the  number  of 
vertices  in  the  graph.  Thus,  a  hierarchically  described  VLSI  layout  with  cell 

instances  CX,CZ,  • • •  ,C^  can  embedded  in  time  0(2^*),  where  n,  is  the 

<«i 

number  of  pins  in  cell  instance  Q . 

An  embedding  of  a  rectilinear  graph  is  just  a  relative  placement  of  the  ver¬ 
tices  (  cells  )  on  a  rectangular  grid,  such  that  no  two  edges  cross.  Some  of  the 
relative  placement  information  is  already  present  in  the  description  of  a  rectil¬ 
inear  graph  For  example,  if  (a ,6)  is  the  rightgoing  edge  of  vertex  a,  then  a 
should  be  to  the  right  of  6.  and  a.b  should  be  on  the  same  horizontal  grid  line. 
Hence,  an  embedding  can  be  viewed  as  a  “completion"  of  the  rectilinear  graph 
description.  We  showed  in  a  different  paper  [8]  that  the  completion  problem  for 
a  slightly  more  relaxed  VLSI  layout  model  is  NP-complete.  In  light  of  this  result, 
the  results  in  this  paper  have  become  more  important. 

In  section  2.  we  present  formal  definitions  of  rectilinear  graphs  and  their 
embeddings.  In  section  3,  we  mention  some  properties  of  rectilinear  graphs.  We 
discuss  some  topological  properties  of  the  embeddings  in  section  4.  A  necessary 
and  sufficient  condition  for  biconnected  rectilinear  graphs  to  be  embeddable  is 
presented  in  section  5.  A  similar  condition  for  arbitrary  rectilinear  graphs  is  the 
main  result  in  section  6.  We  also  describe  a  0(n)  recognition  algorithm  in  this 
section.  In  section  7,  we  use  the  ideas  of  the  previous  sections  to  obtain  an 
0(ne)  embedding  algorithm.  An  important  subclass  of  rectilinear  graphs  is  dis¬ 
cussed  in  section  6.  In  section  B,  we  discuss  extensions  and  open  problems.  For 
definitions  of  graph  theoretic  terminology  used  in  this  paper,  please  refer  to 
[1.2]. 

E.  Definition  of  the  Problem 

First  we  give  a  formal  definition  of  a  rectilinear  graph. 

Definition  2.1:  ArectiNnear  graph  G  is  a  triple  (K,  E,  A),  where  V  is  the  vertex 
set,  E  is  the  edge  set,  and 

A  :VxV-»  E  U  (tj  ,  where  E  =  \L,R,D,U\ 

is  a  vertex  ordering  relation  with  the  following  properties: 
tor  every  a, 6  jc  c  V  and  X  c  E 

(<)  A((«,6))se  «♦  *a,bj*r 

(ordering  is  specified  only  between  adjacent  vertices) 

(*)  A((a ,b ))  s  i  •  A((b,a))  *  R,  A((a.b))  *  D  «.  A((6,a))  =  U, 


(tii)  X((o,6))=JT  -*  A((c ,6 ))  *  X,  v*c  t-  a  (no  overlapping  edges). 

Each  vertex  in  a  rectilinear  graph  has  degree  at  most  four,  and  each  edge 
(a.b),  as  it  goes  from  one  vertex  a  to  the  another  b ,  has  a  nonempty  label  on  it, 
which  in  the  embedding  will  indicate  the  direction  (left, right,  down,  or  up)  in 
which  the  edge  leaves  vertex  a.  There  can  be  at  most  one  edge  with  a  particular 
label  emanating  from  each  vertex.  The  undirected  graph  G(V,E)  will  be 
referred  to  as  the  underlying  graph.  Figure  2.1  (like  all  other  figures)  gives  an 
illustration  of  a  rectilinear  graph. 
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Figure  2.1 
A  rectilinear  graph 

Now  we  define  what  sort  of  an  embedding  we  are  looking  for. 

Definition  2.2:  An  embedding  of  a  rectilinear  graph  G{V,E,\)  on  a  rectangular 
grid  is  given  by  two  mappings  x ,  y :  V-»  Z  (  the  integers  )  which  are  the  x  and  y 
coordinates  respectively  of  the  vertices.  These  mappings  obey: 

1.  the  ordering;  relation.  A,  i.e.  for  all  edges  Ja  ,b  J  e  E 
*  ((a.b))  =  £  -*  y{a)  =  y(b),  x(a)>x(b), 

M (a.b))  = /?  -♦  v(°)  =  !/(*>).  *(«*)<  x(b), 

A((o,b))  =  i?  -»  x(a)  =  x(b).  V(°)>V(6). 

M(a.b»  s  U  -  x(a)  =  x(b),  y(o)<y(b). 

2-  Planarity,  no  two  edges  cross,  i.e.  for  each  pair  of  non-adjacent  edges 
Ja.b  J,  Jc  ,d  j  such  that  A((a,b))  *  R  and  A((c  ,d))  =  U,  the  condition 


r(c)«x(b)  and  jf(c)sy(o)sy(i) 


does  not  hold. 


An  embedding  of  a  rectilinear  graph  on  a  rectangular  grid  is  one  in  which 
the  vertices  are  placed  at  grid  points,  the  edges  run  along  grid  lines  in  the  direc¬ 
tions  given  by  their  labels,  and  no  two  edges  cross  each  other  except  if  they 
share  a  vertex.  We  say  that  a  rectilinear  graph  is  embeddable  if  it  has  an 
embedding.  We  will  show  in  the  next  section  that  not  all  rectilinear  graphs  are 
embeddable. 


Figure  3.2 

Two  nonembeddable  rectilinear  graphs 
whose  blconnected  components  are 

In  this  section  we  list  some  properties  of  rectilinear  graphs  and  their 
embeddings.  These  will  give  an  indication  of  why  the  problem  is  not  trivial  and 
why  it  is  different  from  other  embedding  problems,  and  in  particular,  planar 
graph  embedding  [3,5]. 

1.  Embeddability  is  a  hereditary  property.  Subgraphs  are  defined  in  the  usual 
fashion,  but  here  the  labels  of  edges  are  inherited.  This  is  obvious,  but 
worth  mentioning,  because  this  will  be  used  in  the  proofs. 

2.  If  each  connected  component  of  a  rectilinear  graph  is  embeddable  then  the 
graph  itself  is  embeddable.  So,  without  loss  of  generality  we  will  restrict 
ourselves  to  connected  rectilinear  graphs. 

3.  Rectilinear  graphs  with  nonplanar  underlying  graphs  are  clearly  not 
embeddable.  So  it  is  not  interesting  to  consider  those  graphs.  However,  not 
every  rectilinear  graph  with  a  planar  underlying  graph  is  embeddable.  In 
figure  3.1,  we  have  two  simple  cycles  which  are  not  embeddable. 
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4.  In  contrast  with  planarity,  embeddabiiity  is  not  a  property  determined  by 
the  biconnected  components.  Figure  3.2  provides  an  illustration  of  this 
fact. 

5.  This  problem  is  a  restriction  of  an  NP-eomplete  problem  [9.11].  For  each 
wire  w,  we  are  given  its  orientation  (horizontal  or  vertical),  and  a  set  Vw  of 
vertices.  The  wire  w  has  to  pass  through  each  vertex  in  the  set  V*,  (the  ver¬ 
tices  could  be  touched  in  any  order).  Then,  the  embedding  problem 
becomes  NP-complete. 

6.  If  we  relax  the  rectilinearity  of  the  edges  and  impose  only  the  cycUc  order¬ 
ing  of  the  edges  at  each  vertex,  then  there  is  an  0(  j  V|)  algorithm  [10].  The 
cyclic  orderings  automatically  determines  the  faces  of  the  embedding  (if 
one  exists).  Thus  a  embeddable  rectilinear  graph  has  a  unique  embedding  in 
this  sense. 

4.  Topological  Structure  el  Embeddings 


i 


\((abcdefgh))  =  LURDRUL 
Figure  4.1 

The  extension  of  X  to  paths. 

There  is  a  natural  way  to  extend  the  function  X  to  paths  and  cycles  in  the 
graph  as  follows.  Given  a  path  P  =  (i>o,Vi,  •  •  •  ,vt)  we  define 
X[P)  ~  X((t»o.Vi))X((vi,vg))  •  •  •  X((v{ _!,«()).  We  define  a  similar  extension  for 
cycles  where  now  vt  =  v0.  X  becomes  a  mapping  that  associates  with  each  path 
or  cycle  in  the  graph  a  string  in  E*  which  is  the  concatenation  of  labels  along  the 
path  or  cycle.  Note  that  strings  containing  RL,  DU,  LR,  UD  as  substrings  do 
not  represent  paths.  Also  the  direction  in  which  we  traverse  a  path  and  the 
starting  point  in  a  cycle  are  important.  An  example  of  this  mapping  can  be  found 
in  figure  4.3. 

Next  we  define  two  topological  actions  on  rectilinear  graphs.  These  actions 
will  simplify  a  rectilinear  graph  while  preserving  its  topological  structure.  Let  G 
be  a  rectilinear  graph 


Action  1  -  Edge  Contraction:  Let  (abed)  be  a  path  in  G  such  that  both  b  and  c 
have  degree  2.  and  X((a6cd))  =  XYX  where  X,Y  €  E.  Contract  the  edge  (6,c)  to 
the  vertex  b .  The  resulting  path  (oftd)  will  have  \((abd))  =  XX.  We  abbreviate 


Figure  4.2 

Edge  contraction  and  vertex  deletion 


this  action  by  XYX  -*  XX  (  figure  4.2(1) ). 

Action  2  -  Vertex  Deletion:  Let  (obc )  be  a  path  in  G  such  that  vertex  b  has 
degree  2,  and  X((abc))  =  XX  where  X  e  £.  Delete  the  vertex  b  and  introduce 
the  edge  (a,c).  The  resulting  edge  (a,c)  will  have  X((a,c))  =  X.  We  abbreviate 
this  action  by  XX  ■*  X  (  figure  4.2(2)  ). 

In  a  natural  way  we  can  define  inverses  for  the  above  two  actions  which  we 
will  refer  to  as  edge  expansion  and  vertex  addition  respectively. 

Lemma  4.1:  Let  G  be  a  rectilinear  graph  and  G'  be  the  graph  resulting  from  G 
by  the  application  of  a  sequence  of  the  above  four  actions  Then  G'  is  also  rectil¬ 
inear  and  moreover  G'  is  embeddable  if  and  only  if  G  is  embeddable. 

Proof:  The  proof  is  easy  and  is  left  to  the  reader.  • 


RDLDR  -  R 
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Figure  4.3 

Simplification  of  a  path  and  a  cycle 


Dsfinition  4.1:  Given  a  string  y  e  E’  representing  a  path  or  a  cycle,  the 
wimpliflsd  form  J  of  y  is  obtained  by  repeatedly  applying  the  reduction  rules 
XYX  •*  XX  and  XX  -*  X,  where  X,  Y  e  E,  until  they  cannot  be  applied  any  more. 
If  y  represents  a  cycle  then  it  is  treated  as  a  cyclic  string. 

In  figure  4.3  we  give  a  path  and  a  cycle  along  with  their  simplified  forms. 


Lemma  4.2:  Every  string  7  e  I*  has  a  unique  simplified  form. 

Proof:  The  replacement  system  defined  by  the  two  reduction  rules  have  the 
Church-Rosser  property  [B].  ■ 

Definition  4.2:  A  square  is  one  of  the  cyclic  strings  LURD  or  LDRU. 

Sometimes  we  may  distinguish  between  two  squares  by  their  starting  labels. 

Definition  4.3:  A  spiral  is  a  path  which  cannot  be  simplified.  Equivalently  a 
spiral  is  a  substring  of  (LURD)*  or  (LDRU)*. 

Lemma  4.3:  Every  path  is  embeddable. 

Proof:  Every  spiral  is  embeddable.  Since  any  path  simplifies  to  a  spiral  (by 
definition  ).  by  lemma  4.1  it  is  also  embeddable.  • 

So  it  is  the  cycles  which  make  the  problem  nontrivial.  The  following  lemma 
is  a  crucial  fact  about  cycles. 

Lemma  4.4:  A  cycle  is  embeddable  if  and  only  if  it  simplifies  to  a  square. 

Proof:  if:  A  square  is  embeddable  and  hence  by  lemma  4.1  any  cycle  which 
simplifies  to  a  square  is  also  embeddable. 

only  if:  Let  /  be  an  embeddable  cycle  and  \(/)  =  7.  By  lemma  4.1,  the 
cycle  defined  by  7  is  also  embeddable.  Let  (7 1  *  n.  Look  at  the  embedding  of  7. 
Since  it  has  no  crossings  the  embedding  is  a  simple  polygon.  Therefore  the  inte¬ 
rior  angle  of  this  polygon  sum  to  (n-2).10O°.  Since  7  is  a  spiral  all  its  interior 
angles  are  90°.  The  only  solution  to  n.  90  =  (n-2).lB0  is  n  =4.  Therefore  7  is  a 
square.  ■ 

The  proof  of  the  previous  lemma  suggests  another  useful  characterization 
of  embeddable  cycles.  Going  along  a  cycle  /  *  vxvz  ■  •  •  vnvl  in  the  counter¬ 
clockwise  direction,  let  us  denote  by  p/  (vx)  the  angle  at  vertex  uit  which  is  the 
angle  between  and  (i/j ,w<+l). 

Lemma  4.5:  A  cycle  /  =  WjWa  •  •  v*vj,n  4  is  embeddable  if  and  only  if 

?(/)  *  £?/(**)  =  (*±2)180°  . 

t*i 

Proof:  Suppose  /  is  embeddable,  then  its  embedding  is  a  simple  polygon. 
Depending  on  whether  we  sum  the  interior  angles  or  exterior  angles  we  should 
get  (n  ±2).  180°. 

To  prove  the  sufficient  part  we  show  by  induction  of  n  that  /  simplifies  to  a 
square.  The  possible  values  for  ff/(vt)  are  90,1 80, 270°.  The  basis  for  induction  is 
n  =  4.  In  this  case  the  given  sum  of  the  angles  is  either  360°  or  1080°,  which 
implies  that  each  angle  is  either  90°  or  360°  respectively.  So  /  must  be  a 
square  by  itself. 

Assume  that  the  claim  is  true  for  all  values  less  than  n  and  let  n  >  4.  If  for 


some  4,  ?/(»*)  =  180°.  then  Xfa. =  XX  We  can  apply  vertex  deletion  at 
v*  to  obtain  /' s  Then 

f  (/ ')  B  p(/)  -  9/(vi)  -  ((n-l)±2),180°.  and  by  induction  we  are  done. 

This  leaves  the  case  where  all  angles  are  either  60*  or  270*.  Since  n  >  4 
end  <f{J )  =  (n±2).180°  not  all  the  angles  can  be  equal.  Hence  there  must  be  a  * 
such  that  ^  f/(v*«).  Hence  we  have  X(vt_,v*t/t+1vJe^2)  =  XYX.  Apply 

edge  contraction  to  obtain  /'  *  Vj  •  •  •  v*  -[Wt«2  *  •  *  vnv  ,.  The  edge  contrac¬ 
tion  removed  380°  from  the  angle  sum  and  added  1B0°.  Hence 
*(/')  =  ((n-l)±2).l80*.- 

Deflnition  4.4:  A  complement  of  a  path  P  with  respect  to  a  square  a  is  any 
path  Pc in  the  graph  such  that  PPci s  a  cycle  which  simplifies  to  \(PP*)  -  a. 

lanma  4.6:  Given  a  path  P,  all  its  complements  with  respect  to  a  square  a, 
which  have  the  same  start  and  end  labels,  have  a  unique  simplified  form. 

Ph»f:  Let  A (P)  =  a  =  X{Xi  •  •  ■  X* .  Since  a  is  a  spiral  we  have  Xt  -  Xj  for 
i  =  j  (4).  Assume  that  Je  >  4  and  that  the  spiral  a  and  the  square  a  are  either 
both  clockwise  or  both  counterclockwise.  Then  o  must  be  a  substring  of  a.  Since 
a  is  a  cyclic  string  we  can  assume  that  a  =  XxXzXaX4. 

Let  Pc  be  a  complement  of  P  with  respect  to  o  and  let  A(P£)  =  p  Since 
k  >  4,  p  must  spiral  in  the  opposite  direction  to  a.  Since  both  a  and  p  are 
simplified,  ap  can  be  simplified  only  at  the  borders  between  the  two  strings. 
Write  p  =  PtPzPa,  such  that  =  a  We  are  allowed  to  shift  p3  because  ap  is  a 
cyclic  string.  Then  it  is  clear  that  0,  e  {X^-sX*  ,X^,ej  and  pa  e  \e.Xi.XxX4].  pz  is 
the  'essential  part'  of  p.  Since  ]a|  =  Jfc  and  o  =  4,  we  must  have  \p2  \  si- 4. 
From  the  possible  values  of  px  and  p3,  and  the  fact  that  p  is  a  spiral  opposite  in 
direction  to  a  ,  we  can  conclude  that  pe  =  A*_rAi_z  -  X4  We  used  k  >  4  in 
order  for  pz  not  to  be  an  empty  string.  Therefore 
P  -  \i.Xt.Xt  •  •  •  X4\tJCl,XiX4\,  which  is  unique  but  for  the  start  and 

end  labels.  The  arguments  in  the  cases  where  a  and  c  are  in  opposite  directions 
and  for  k  <  4  are  similar.  ■ 

6.  Biconnected  Rectilinear  Graphs 

In  this  section  we  discuss  an  algorithm  for  recognizing  biconnected  rectil¬ 
inear  graphs.  Note  that  the  ordering  relation  X  induces  a  cyclic  ordering  of  the 
edges  incident  at  each  vertex  v.  For  convenience  we  will  need  the  following 
definition. 

Definition  5.1:  Let  v  be  a  vertex  in  a  rectilinear  graph  G.  Define  Lc(v)  to  be  the 
cyclic  list  of  the  neighbors  of  v  in  G  in  the  counterclockwise  order. 

Using  these  lists,  we  can  define  the  essential  notion  of  a  candidate  face  of  a 
biconnected  rectilinear  graph. 


.•W-.f'. 
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CF,(«)  =  2.1, 5, 4, 3.2  and  CFt(e)  =  1.2.7.6.5.4.3.2 
F\gure  5.1 
Candidate  faces 

Definition  5.2:  Let  G  =  ( V,E,K )  be  a  biconnected  rectilinear  graph.  With  each 
edge  e  =  ft/,  >  vep  we  associate  two  lists  of  vertices  called  candidate 

faces  CF  ,(e)  and  CFz{e )  which  are  defined  as  follows. 
CF t(e)  =  vvvs,  •  •  •  .i/k,t/*+1  where  v*  ¥■  vi  for  ij  *vk+v  i  *■  j  and  t#**,  =  v* 
for  some  i,  1  <  i  <  4—1,  such  that  for  each  I,  1  <  f  <k+ 1.  w,*,  is  the  successor 
of  vf_i  in  the  cyclic  list  Lc{vt).  CFe{e)  is  similarly  defined  but  starting  with 
vs.vv 

It  is  easy  to  see  that  CF,  and  CFZ  are  uniquely  defined.  An  illustration  of 
this  definition  is  given  in  figure  5.1. 

We  now  need  a  lemma  about  biconnected  undirected  graphs.  Let  us  define 
a  biconnected  graph  to  be  minimal  if  for  every  edge  e  in  the  graph  G—e  is  not 
biconnected.  The  following  lemma  is  taken  from  [2]  and  is  stated  without  proof. 


Demina  5.1:  If  G  is  a  minimal  biconnected  graph  having  at  least  four  vertices 
then  G  contains  a  vertex  of  degree  two. 


Lemma  5.2:  In  any  biconnected  graph  G  which  is  not  a  simple  cycle,  there  is  a 
simple  path  P  »  (vi,v8),(t/8,t>8),  •  •  •  ,(wr_j,wr)l  r  fc  2,  with  the  intermediate  ver¬ 
tices  ( if  any  )  viti  l.r  all  having  degree  2,  such  that  the  graph  G'  *  G-P  is 
biconnected. 

ftnoof:  Transform  the  given  graph  G  to  another  graph  G"  by  replacing  all  paths 
of  the  form  P  ®  (wi,w8).(v8,us),  •  •  •  .(vr_|.t/r)  where  the  vertices  i  w  l.r  all 
have  degree  2,  by  the  edge  ( vx,vr ).  So  for  each  edge  e  in  G"  we  have  a 
corresponding  path  P,  in  G.  Note  that  the  degree  of  any  vertex  in  G"  is  at  least 
three.  If  G"  has  multiple  edges  between  some  two  vertices,  say  u  and  to,  then  in 
G  there  must  be  at  least  two  parallel  paths  between  u  and  tv.  Since  G  is  not  a 
simple  cycle  any  one  of  those  paths  will  serve  our  purpose.  If  G"  does  not  have 
multiple  edges  then  it  must  have  at  least  4  vertices.  By  lemma  5.1  G "  cannot  be 
minimal.  Therefore  there  is  an  edge  c  in  G"  such  that  G"—e  is  biconnected. 

t  For  convenience  we  aseume  that  V  ie  «  eet  of  integers. 


which  implies  that  G-P,  Is  also  biconnected.  ■ 

The  following  theorem  gives  a  necessary  and  sufficient  condition  for  a  bicon¬ 
nected  rectilinear  graph  to  be  embeddable. 


Figure  5.2 


Two  possible  embeddings  of  CFj(e) 


Figure  5.3 

The  two  cycles  /  x,f  e  and  the  path  P 


Theorem  5.1:  Let  G  =  ( V,E,\ )  be  an  biconnected  rectilinear  graph  with  at  least 
three  edges.  Then  G  is  embeddable  if  and  only  if  for  each  edge  e  in  the  graph 
both  the  candidate  faces  CF ]{e)  and  CFe(e)  represent  simple  embeddable 
cycles  in  the  graph  (i.e.  the  starting  and  ending  vertices  are  identical,  and  it 
simplifies  to  a  square).  Moreover,  if  the  graph  is  embeddable  each  such  distinct 
candidate  face  corresponds  to  a  face  In  the  planar  embedding. 

Proof: 

only  if:  Supposing  for  some  edge  *=(V|,Vj),  CFj(e)  is  not  a  cycle,  i.e 
CF i(e)  =  v,,v8,  •  •  •  with  v%  =  v*+l  for  some  i,  1  <  i  <  it—  1.  Suppose  G 

is  embeddable.  Look  at  the  cycle  v*.  -  -  ,vk,vk+l  in  the  embedding  Suppose 
that  the  edge  (vt-i,Vi)  is  inside  this  cycle.  There  can  be  no  other  edges 
(u.vj),  i<  j  inside  this  cycle,  otherwise  u  would  have  appeared  instead  of 
vj+ 1  in  CFj(e).  From  the  planarity  of  the  embedding,  there  dfcn  be  no  path  from 
Vi-i  to  Vi  other  than  the  edge  (vi-i.Vi).  This  contradicts  the  biconnectedness  of 
G.  The  case  where  (Vi_i,Vi)  is  outside  the  cycle  is  similar  (  both  cases  are  dep¬ 
icted  in  figure  5.2 ). 


Suppose  CF j{e)  is  a  cycle  but  is  not  embeddable.  Since  CFj(e)  is  a  sub¬ 
graph  of  G,  G  itself  cannot  be  embeddable.  Similar  arguments  holds  for  CFt(t). 

if:  The  proof  of  this  part  is  by  induction  on  the  number  of  edges.  The  basis 
for  the  induction  are  simple  embeddable  cycles,  for  which  the  claim  is  true  by 
lemma  4.4.  Assume  that  the  claim  is  true  for  any  biconnected  rectilinear  graph 
which  has  less  than  k  edges.  Let  G  be  a  biconnected  rectilinear  graph  which  is 
not  a  simple  cycle  and  which  has  k  edges.  By  lemma  5.2,  there  is  a  simple  path 
P  =  (vi.v8),(v8,va),  •  •  ■  ,{vr-i.Vr)  with  the  vertices  vit  i  *  l ,r  all  having  degree 
2,  such  that  the  graph  G*  =  G-P  is  biconnected.  and  tv  will  have  degree 
greater  than  two.  Also  assume  that  t/j  >  v8  and  e  l8  =  (v t,i;8). 

Since  all  our  candidate  faces  are  cycles,  if  an  edge  e  lies  on  a  candidate 
face  /  then  either  CFt(e)  =  /  or  CFt(a)  -  f .  So  each  edge  will  be  present  in 
exactly  two  of  these  candidate  faces.  Hence  the  path  P  will  appear  in  CF^e  18) 
and  its  reverse  path  will  appear  in  CFs(e  1Z).  Let 
/ 1  =  CFl(*a)  =vvv2.  •  ♦  •  , t/r .1*1.  •  •  •  .tt,-,!/! . 
fa  ~  CFt(alz)  =tir.vr_,,  •  •  ■  .vi.toj,  •  •  •  ,wk.vr  ,  and 
/ a**i.*i.  •  '  •  •  •  ■  Mj.vi- 

It  follows  from  the  definition  of  the  candidate  faces  / ,  and  /2  that  the  vertices 
Uj.vt,wi  appear  consecutively  in  that  order  in  Lc{vx)  and  that  u>*.vr_,.u, 
appear  similarly  in  Lc(vr)  (  see  figure  5.3  ).  Therefore  for  each  edge  in  / 1  or  /8 
which  is  not  in  P,  the  new  candidate  face  in  G'  will  be  /  9  which  is  a  simple  cycle. 

We  still  have  to  show  that  /a  is  embeddable.  Since  / ,  and  /e  are  both 
embeddable  p(/i)  =  (r+y±2).10O°  and  p(/8)  =  {r+k  ±2).  160°  .  However,  since 
/ 1  and  fg  share  the  edge  «i8  it  is  implied  by  the  definition  of  candidate  faces 
that  p(/j)  =  (r+^+2).180°  and  p(/8)  =  (r+*+2).lB0°  is  impossible.  With  a  little 
bit  of  algebraic  manipulation  we  can  show  that  f{f  3)  =  +2)  ±2).  160*.  Since 

/ s  has  3  +  k  42  vertices  by  lemma  4.5,  it  is  embeddable.  Thus  the  candidate  faces 
for  G’  are  the  same  as  those  for  G,  excepting  for  /  a  replacing  the  two  faces  / 1 
and  /  2  So  for  each  edge  in  G*  its  two  candidate  faces  are  again  simple  embedd¬ 
able  cycles.  By  Induction  hypothesis  G'  is  embeddable  and  each  distinct  candi¬ 
date  face  corresponds  to  a  face  in  its  embedding.  The  orderings  of  the  edges  at 
the  vertices  Vi  and  v,  imply  that  the  end  edges  (v,.v8)  and  (tv-i.tv)  of  the  path 
P  are  both  trying  to  go  inside  the  face  corresponding  to  /9. 

We  are  left  to  show  that  we  can  add  the  path  P  back  without  destroying 
embeddability.  Find  any  rectilinear  path  P'  in  the  face  corresponding  to  /  3  in 
the  embedding  of  G',  that  starts  and  ends  with  X((vt,t;8))  and  A((tv_,,tv)) 
respectively.  This  is  clearly  possible  although  we  may  have  to  extend  the  grid  in 
order  for  P'  to  lie  on  the  grid  lines.  P'  creates  a  face  in  the  embedding  with  the 
path  P,  -  iv u it* 2  •  •  •  Mj«i.  If  / a  is  not  the  outside  face  then 
K{PiP')  =  \(/\)  ~  *[P\P)  =  The  case  when  / 3  is  the  outside  face  is  slightly 
more  complicated.  There  are  two  such  different  paths  P’  depending  on  the  new 
outer  face  that  is  created.  However,  for  one  of  the  two  the  above  holds  and  sup¬ 
pose  this  is  the  one  we  chose.  By  definition  both  P  and  P  are  complements  of 


Pi  with  respect  to  a,  they  also  share  the  same  start  and  end  labels,  and  by 
lemma  4.8  we  have  \{P)  =  k[P').  Therefore  G'+P‘  can  be  obtained  from  G  by 
applying  a  sequence  of  the  four  topological  actions,  and  since  G'+P'  is  embedd¬ 
able,  by  lemma  4.1  G  is  also  embeddable.  It  is  easy  to  see  that  the  two  new 
faces  we  get  after  inserting  P  in  the  embedding  of  G'  correspond  to  / 1  and  /  *.  • 

The  above  theorem  leads  to  the  following  algorithm  for  recognizing  embedd¬ 
able  biconnected  rectilinear  graphs.  The  algorithm  also  outputs  the  faces  of  the 
embedding  if  the  graph  happens  to  be  embeddable. 

Algorithm  check-biconnected(G), 

begin 

if  G  is  an  edge  then  return; 
if  |£|  >  3 1  Kj  —6  then 
begin 

write (  'not  embeddable'): 
quit 

end: 

for  each  edge  e  do 

begin 

mark[e,l].=  /alse; 
mark[e  ,2]:=  false 

mid: 

for  each  edge  e  do 
for  i:=  1  to  2  do 
begin 

if  not  mark[e  ,i]  then 
begin 

/  :=  candidate-face (  e,  i  ): 
if  not  embed  — cycle  {J  )  then 
begin 

write (  'not  embeddable' ); 
quit 

end; 

for  each  edge  s'  =  (t/i.t/g)  in  f  do 
if  Vj  >  Kg  then  mark[e',l]:=  true 
else  mark[e  \2].=  true; 
output  (f) 

end 

end 

end 

Boolean  function  e mbed-cycle(f)  returns  value  true  if  /  is  an  embeddable 
cycle.  If  /  is  a  cycle  then  we  simplify  using  the  reduction  rules  and  check  if  we 
end  up  with  a  square.  This  can  be  done  in  time  linear  in  the  size  of  / .  Function 


call  candidatm-fac*  (e ,i)  returns  the  candidate  face  C/t(*)  and  the  function  can 
be  implemented  exactly  as  described  in  definition  5.2.  In  the  calls  to  this  func¬ 
tion,  each  edge  c  can  be  traversed  at  most  twice,  due  to  the  flags  mark[e  ,1]  and 
mark[e  ,2].  Therefore  the  algorithm  runs  in  time  0{  |  V|).  We  conclude  this  sec¬ 
tion  with  a  lemma  which  will  let  us  identifly  the  outer  face  in  a  rectilinear  graph. 

Ishhim  5.3:  Let  G  be  an  embeddable  biconnected  rectilinear  graph.  For  all 
interior  faces  /  in  the  embedding  of  G,  ?(/)  =  (n-2).lB0“,  and  for  the  unique 
exterior  face  /,.  p(/,)  =  (n+2).180V 

Proof:  Consider  the  embedding  of  G  The  faces  of  the  embedding  are  deter¬ 
mined  by  G,  and  are  simple  polygons  in  the  plane.  By  the  definition  of  p.  for 
every  interior  we  count  the  interior  angles,  and  for  the  exterior  face  we  count 
the  exterior  angles.  The  lemma  follows.  (  Remember  that  if  G  is  a  simple  cycle, 
the  embedding  has  two  faces  ).  • 


Figure  5.4 
Shapes  U  and  W 


Lemma  5.4:  Let  G  be  an  embeddable  biconnected  rectilinear  graph.  /,  the 
exterior  face  in  its  embedding  and  v  a  vertex  on  /, .  If  P/t(v)  =  160°  then  G 
can  be  embedded  inside  a  polygon  of  shape  U,  as  shown  in  figure  5.4a.  If 
=  270°  then  G  can  be  embedded  inside  a  polygon  of  shape  W,  as  shown  in 
figure  5.4b. 

ftroof:  Easy  and  left  to  the  reader.  • 

8.  Articulation  Vertices 

In  this  section  we  examine  the  conditions  under  which  the  embeddability  of 
the  biconnected  components  of  the  graph  implies  the  embeddability  of  the 
graph  itself.  Clearly,  this  will  depend  on  the  way  components  meet  at  articula¬ 
tion  vertices.  In  figure  3.2,  we  showed  two  examples  of  nonembeddable  rectil¬ 
inear  graphs,  each  of  which  decomposes  into  two  embeddable  biconnected  rec¬ 
tilinear  graphs. 

In  those  cases,  the  two  biconnected  components  are  not  "compatible"  at 
the  articulation  vertex.  However,  the  situation  need  not  be  so  local.  Figure  6.1 


i 
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Figure  6.1 

Decompositions  of  nonetxxbe ddable  graphs 

depicts  two  nonembeddable  graphs,  each  of  which  decomposes  into  three 
embeddable  biconnected  components,  so  that  the  components  meeting  at  each 
articulation  vertex  are  compatible.  Note  that  an  edge  is  a  (trivial)  biconnected 
component. 

If  v  is  an  articulation  vertex  in  a  graph  G,  then  its  removal  results  in 
several  connected  subgraphs  G*  of  G.  We  will  refer  to  the  subgraphs  Gi+v.  as 
the  subgraphs  meeting  at  v .  Throughout  this  section  we  will  implicitly  assume 
that  we  are  dealing  with  rectilinear  graphs  whose  biconnected  components  are 
embeddable. 

Definition  6.1:  Let  Bx  and  Bg  be  two  nontrivial  biconnected  components  of  a 
rectilinear  graph  G  that  share  an  articulation  vertex  v .  Then  Bx  and  Bg  are  said 
to  interlace  if  the  horizontal  edges  at  v  belong  to  By  and  the  vertical  edges 
belong  to  Bg.  We  also  say  that  v  is  an  interlace  vertex.  Any  articulation  vertex 
that  does  not  have  this  property  is  said  to  be  interlace -free. 


ten.ma  8.1:  A  rectilinear  graph  G  which  has  an  interlace  articulation  vertex  v  is 
not  embeddable. 

Proof:  l^t  Bx  and  Bg  be  the  two  biconnected  components  sharing  the  vertex  v . 
Since  Bx  and  Bg  are  nontrivial,  the  horizontal  edges  at  v  lie  on  a  cycle  in  Bx  and 
the  vertical  edges  lie  on  a  cycle  in  Bg.  It  is  impossible  to  draw  G  on  the  plane 
without  these  two  cycles  crossing.  ■ 

Definition  6.2:  Let  Bx  and  Bg  be  two  non  interlacing  biconnected  components  of 
G  that  share  an  articulation  vertex  v,  and  assume  By  is  nontrivial.  Then  Bg  is 
said  to  be  inside  Bx  ( or  Bx  dominates  Bg  at  v  )  if  either  (i)  v  is  not  on  the  exte¬ 
rior  face  of  Bx,  or  (ii)  edges  (v  ,u)  and  (v,u>)  at  the  vertex  v  are  on  the  exterior 
face  of  By.  and  u.w  are  consecutive  in  that  order  in  Ze(v)  (  note  that  they  are 
always  consecutive  in  Lgt(v) ).  If  neither  By  dominates  Bg  nor  Bg  dominates  By, 
then  B j  and  Bg  are  said  to  be  outside  each  other. 


The  intuition  behind  the  above  definition  is  that  in  the  embedding,  one 
biconnected  component  must  lie  wholly  inside  some  face  of  the  other  if  one 
edge  of  it  does.  This  is  due  to  the  planarity  criterion  Gearly,  if  biconnected 
components  Bx  and  Bz  that  share  an  articulation  vertex  v  dominate  each  other, 
the  graph  is  not  embeddable  ( this  is  the  case  in  figure  3.2b  ). 

Let  B\  and  Bz  be  two  biconnected  components  of  a  graph  G  that  share  an 
articulation  vertex  v,  such  that  Bx  dominates  Bz-  Let  G'  be  the  subgraph  of  G 
meeting  at  v,  that  contains  Bg  If  G  is  embeddable  then  in  any  embedding  of  G, 
all  of  G'  should  lie  inside  one  face  of  B i-  This  suggests  extending  the  relation 
"dominate"  as  follows: 

Definition  6.3:  Let  B  =  \BX,BZ,  *  *  *  ,Bm\  be  the  set  of  biconnected  components 
of  G.  We  say  that  Bi  dominates  Bj  if  there  exists  a  biconnected  component  Bk 
and  an  articulation  vertex  v,  such  that  (i)  Bk  and  Bi  share  v,  (H)  Bi  dominates 
Bk  at  v ,  and  (Hi)  Bj  and  Bk  are  both  subgraphs  of  the  same  connected  sub¬ 
graph  meeting  at  v . 

Let  us  denote  by  F(G)  the  vertex  set  of  the  graph  G  and  by  E(G)  the  edge 
set. 

Lemma  6.2:  If  in  a  rectilinear  graph  G,  there  exists  some  pair  of  biconnected 
components  Bx  and  Bz  that  dominate  each  other,  then  G  is  not  embeddable. 
Proof:  If  Bx  and  Bb  share  an  articulation  vertex  v ,  then  as  mentioned  earlier  G 
is  not  embeddable.  Suppose  that  Bx  and  Bz  are  disjoint.  Since  Bx  and  Bz  dom¬ 
inate  each  other,  there  must  be  articulation  vertices  vx,vB,  biconnected  com¬ 
ponents  Bx\Bz.  and  subgraphs  Gx,Ge,  such  that  for  i  =  1.2,  (i)  Bt  and  Bt'  share 
(H)  Bi  dominates  2?*'  at  vt,  and  (HI)  Gj  is  one  of  the  subgraphs  meeting  at 
and  contains  Bi'.  Let  us  assume  that  G  is  embeddable.  From  (i)  vB  £  V(G,). 
(u)  Gx  lies  wholly  inside  Bx  in  the  embedding,  and  (Hi)  V(Gx)r\V(Bx)  =  we 
can  conclude  that  ue  must  be  properly  inside  a  polygon  defined  by  the  face  /  x  of 
Bx  containing  vx.  Similarly  vt  should  be  properly  inside  the  polygon  defined  by  a 
face  / s  of  Bz  containing  v2.  Therefore  some  vertices  of  /2  must  lie  outside  /  x 
and  the  two  faces  must  intersect,  and  hence  G  is  not  embeddable.  ■ 

Given  a  rectilinear  graph  G,  with  set  of  biconnected  components  B  and  set 
of  articulation  vertices  A,  we  can  construct  a  tree  T  of  biconnected  components 
such  that  V(T)  -  AuB,  and  E(T)  -\(v,B)  \  v  €  A  B  e  B,  v  e  V(B)\. 

Lemma  6.3:  Let  G  be  a  rectilinear  graph  with  the  set  of  biconnected  com¬ 
ponents  B  and  tree  of  biconnected  components  T.  Let  B  be  a  leaf  in  the  tree  T 
which  is  adjacent  to  an  articulation  vertex  v  of  degree  2  in  T.  If  F  dominates  B' 
the  other  biconnected  component  adjacent  to  v  in  T,  then  B  dominates  every 
other  biconnected  component  in  B. 

Proof:  The  only  two  subgraphs  meeting  at  v  are  B  and  G-B+v  and  the  proof 
follows  from  definition  6.3.  ■ 


If  no  two  biconnected  components  dominate  each  other,  then  the  relation 
“dominate”  induces  a  partial  order  on  B.  A  nondominating  element  in  this  partial 
order  is  a  biconnected  component  which  does  not  dominate  any  biconnected 
component. 

Corollary  6.1:  If  for  a  rectilinear  graph  G,  “dominate"  is  a  partial  order,  then 
there  exists  a  nondominating  biconnected  component  which  is  a  leaf  in  the  tree 
T  of  biconnected  components. 

Proof:  Any  trivial  biconnected  component  (  which  is  just  an  edge  )  must  be  non¬ 
dominating.  If  any  vertex  in  T  (  corresponding  to  an  articulation  vertex  in  G  )  is 
adjacent  to  two  leaves,  then  either  the  two  leaves  are  nontrivial  and  not  dom¬ 
inating,  or  one  of  them  is  a  trivial  biconnected  component.  If  no  vertex  in  T  is 
adjacent  to  two  leaves,  then  all  leaves  are  adjacent  to  vertices  of  degree  2,  and 
there  are  at  least  two  such  leaves.  If  two  of  these  leaves  are  dominating,  then  by 
lemma  6.3  the  two  leaves  dominate  each  other  which  is  a  contradiction  that 
"dominate"  is  a  partial  order.  In  fact  all  of  these  leaves  must  be  nondominating.  ■ 

Theorem  6.1:  Let  G  be  a  rectilinear  graph  and  B  its  set  of  biconnected  com¬ 
ponents.  G  is  embeddable  if  and  only  if 
(i)  every  biconnected  component  B  in  B  is  embeddable, 

{ii)  every  articulation  vertex  in  G  is  interlace-free,  and 
(t« )  "dominate"  induces  a  partial  order  on  B. 

Proof:  The  necessary  part  follows  from  lemma  6. 1  and  lemma  6.2. 

The  sufficient  part  is  shown  by  induction  on  the  number  of  vertices.  The 
basis  for  induction  is  any  biconnected  rectilinear  graph.  Let  G  be  not  bicon¬ 
nected  with  |  K(G)|  =  n.  Assume  that  the  claim  is  true  for  all  smaller  graphs. 
Look  at  the  tree  T  of  biconnected  components.  By  corollary  6.1,  there  exists  a 
leaf  B  in  T  which  is  nondominating.  Let  v  be  the  articulation  vertex  shared  by  B 
and  G'  =  G—B+v,  the  rest  of  the  graph.  G  being  a  subgraph  of  G  also  satisfies 
the  conditions  of  the  claim.  By  induction  hypothesis  G'  is  embeddable.  By  condi¬ 
tion  (i),  B  is  also  embeddable.  If  B  is  a  single  edge  it  is  easy  to  add  the  edge  to 
the  embedding  of  G'.  Assume  B  is  nontrivial.  Since  B  is  nondominating,  v  must 
lie  on  the  exterior  face  /,  of  B  and  py  (v)  w  90°  (why?). 

Embed  G'  and  B  separately  and  consider  the  vertex  v  in  both  embeddings 
If  =  180°,  then  v  is  on  only  one  edge  in  G\  Add-  new  grid  lines  to  the 

embedding  of  G\  create  the  shape  U  as  shown  in  figure  6.2a,  magnify  the  embed¬ 
ding,  and  embed  B  in  the  U  as  in  lemma  5  4a.  If  f/,(v)  =  270°,  then  v  is  either 
on  just  one  edge  in  G* ,  or  on  two  perpendicular  edges  in  G’.  In  both  cases,  add 
new  grid  lines,  create  the  shape  W  and  embed  B  as  shown  in  figure  6.2b.  ■ 

Before  we  describe  an  algorithm  for  testing  embeddability,  we  need  an  algo¬ 
rithm  for  testing  whether  "dominate"  is  a  partial  order  on  the  set  of  biconnected 
components.  From  the  tree  T  of  biconnected  components,  we  construct  f  a 


Figure  6.2 

Adding  B  to  the  embedding  of  G' 

partially  directed  tree  as  follows.  Assume  that  no  biconnected  component  dom- 
inates  and  is  dominated  at  the  same  vertex.  If  so  then  "dominate"  is  not  a  par¬ 
tial  order.  Direct  edge  (v  ,B)  from  B  to  v  if  B  dominates  at  v .  Direct  edge 
(v  ,B )  from  v  to  B  if  B  is  dominated  at  v .  Leave  all  the  other  edges  undirected. 

This  partially  directed  tree  f  can  be  constructed  in  linear  time  as  follows. 
Find  the  faces  of  each  of  the  biconnected  components  using  the  algorithm 
check-biconnected  This  takes  0(  |K|)  time.  Check  for  dominations  at  each 
articulation  vertex  as  described  in  definition  6.2.  There  are  at  most  4  bicon¬ 
nected  components  at  each  articulation  vertex  and  hence  there  are  at  most  12 
(ordered)  pairs  to  be  tested  for  domination  ( in  fact  only  2  tests  are  necessary, 
how?  ).  Construct  T  by  directing  the  edges  of  T  as  described  earlier.  Note  that 
articulation  vertices  and  biconnected  components  can  be  found  in  0(|  F|)  [l]. 
For  each  vertex  *  in  f ,  denote  by  d*n(*),  d*ut(z).  and  d(x).  the  number  of 
incoming  arcs,  the  number  of  outgoing  arcs,  and  the  number  of  undirected 
edges  respectively.  The  rest  of  the  algorithm  is  given  below. 

Algorithm  check-domvnate-po(G); 

begin 

construct  f ; 

for  each  vertex  *  in  T  do 

If  rftn(z)  >  1  then 
begin 

write  (  'not  a  partial  order’  )\ 
quit 

end 

If  search  (f)  then  write  (  yes,  partial  order’ ) 
dee  write  (  ’not  a  partial  order ' ) 

end 

function  search  (f);  boolean, 

begin 

If  T  =  •  then  search true 


else  begin 

if  35  EBwitht^WsO,  d,»(J?)+<i(F)=l  then 
begin 

Let  v  be  the  neighbor  of  B, 

if  (v)+rf(v)=l  then  search. ;=  search{T-{B\) 

else  starch :  =  search ( T— \B ,v  {) 
end  else  search :  =  false 


The  above  algorithm  can  be  easily  shown  to  be  correct  using  definition  6.3 
and  corollary  6.1.  The  boolean  function  search  can  be  implemented  nonrecur- 
sively  to  run  in  linear  time  by  maintaining  a  queue  of  the  leaves  of  T. 

Given  the  biconnected  components  and  articulation  vertices,  checking  that 
the  articulation  vertices  are  interlace-free  can  be  done  in  0(|K|)  time.  Let 
check-interlace-free  be  a  procedure  that  checks  a  given  articulation  vertex  for 
interlace-freedom.  We  end  this  section  with  a  0(|F|)  algorithm  for  testing 
embeddability  of  rectilinear  graphs. 


Algorithm  check-rectilinear  (G), 

begin 

Decompose  G  into  its  biconnected  components; 

for  each  biconnected  component  B  do  check-bicormected(B). 

for  each  articulation  vertex  v  do  check-interlace-free (v), 

check-dominate-po(G) 

end 


7.  An  Embedding  Algorithm 

In  the  previous  section  we  gave  an  algorithm  for  testing  embeddability. 
This  algorithm  can  be  easily  modified  into  an  algorithm  which  gives  an  embed¬ 
ding.  However,  the  complexity  of  this  naive  algorithm  would  be  0{  |  F|s).  The  rea¬ 
soning  is  as  follows.  The  path  P'  that  we  find  in  the  proof  of  theorem  5. 1  could  be 
0(  I  Vj)  long.  For  each  topological  action  that  we  apply  on  this  path  to  transform 
it  to  the  path  P,  we  update  the  coordinates  of  the  vertices  in  the  embedding 
once.  Thus  tor  each  path  added  we  require  0(  |  V\2)  time.  There  can  be  0(  j  V\ ) 
such  paths  and  hence  the  complexity  of  the  algorithm  is  0{  |  V|a).  To  reduce  the 
complexity  to  0(|  F|)2),  we  have  to  make  sure  that  the  path  P'  is  never  longer 
(asymptotically)  than  the  path  P.  In  this  case  the  sum  of  the  lengths  of  all  such 
paths  P'  is  0{ |  V|),  and  the  0(  |  F|8)  complexity  follows.  In  the  following,  we  show 
how  we  can  always  find  such  paths,  describe  the  algorithm,  and  analyze  its  com¬ 
plexity. 


l*win»  7.1:  Let  G  be  a  planar  biconnected  multigraph  with  minimum  degree 


three.  Then  any  embedding  of  G  has  an  interior  face  of  size  at  most  five. 

Rroof:  The  dual  G*  of  G  is  also  a  planar  graph.  Since  G  has  minimum  degree  3. 
Gd  is  a  simple  graph.  Hence  G4  has  at  least  two  vertices  of  degree  <  5  [2].  G  is 
biconnected  and  hence  one  of  the  vertices  must  correspond  to  a  face  of  size  s  5. 

a 

Lemma  7.2:  Given  an  embedding  of  a  planar  biconnected  graph  G,  which  is  not  a 
cycle,  there  is  a  simple  path  P,  such  that  (i)  the  interior  vertices  of  P  all  have 
degree  2,  (ti)  the  end  vertices  of  P  have  degree  2,  ( iii )  P  appears  in  an  inte¬ 
rior  face  /  in  the  planar  embedding,  and  (tv)  5.  \P  |  &  \f  | . 

Proof:  As  in  the  proof  of  lemma  5.2,  transform  G  to  G'  by  replacing  all  paths 
with  property  (i)  and  (ii)  by  edges.  By  lemma  7.1,  G'  has  an  interior  face  /  of 
size  at  most  5.  The  longest  of  all  the  paths  in  G  corresponding  to  the  edges  of  / 
will  satisfy  conditions  (iii)  and  (tv).  ■ 

To  get  an  embedding  of  a  given  rectilinear  graph,  we  first  test  if  the  graph  is 
embeddable  and  then  apply  the  following  algorithm. 

Algorithm  embed-rectUinear(G), 
begin 

for  each  biconnected  component  B  do  embed-bicamnected  (B); 
join-the-embe  tidings, 

end 

Algorithm  embed-biconnected(B), 
begin 

get-long-path  (  P,  P,.  a  ); 
embed-biconnected  (  B  -  P  ); 
findpath-in-emb e tiding  (  P\  Px,  o  ); 
apply-actions-and-transform  ( P\  P) 

end 

Procedure  get-long-path  returns  paths  P,P t,  and  square  o,  such  that  P 
satisfies  the  conditions  of  lemma  7.2,  and  the  interior  face  /  =  PPX  simplifies  to 
o.  By  lemma  7.2  such  a  path  exists. 

Procedure  findpathnn-embedding  traces  a  path  P'  in  the  embedding  of 
B—P,  such  that  P‘  starts  and  ends  in  the  same  directions  as  P,  and  P'P\ 
simplifies  to  o.  P'  and  P  are  both  complements  of  Px  with  respect  to  the  square 
a.  Since  PP\  is  an  interior  face,  P’  can  be  obtained  by  starting  in  the  required 
direction,  then  following  the  path  Px  in  the  embedding  of  B-P,  and  ending  in 
the  required  direction  (figure  7.1).  This  will  result  in  P'  being  a  complement  of 
Pi  with  respect  to  o.  We  have  \P'  \  -  0(  |  P\  |)  =  0(  \  P  | ). 

Procedure  apply-actions-and-transform  applies  a  sequence  of  the  four 
topological  actions  to  P'  in  the  embedding  of  B-P+P'  and  transforms  it  to  P 
thus  resulting  in  a  embedding  of  B .  This  is  done  by  first  simplifying  the  path  P' 
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Figure  7.1 

Finding  the  path  P‘  in  the  embedding  of  B—P 
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Figure  7.2 

Path  addition,  simplification  and  expansion 

and  then  expanding  the  simplified  path  to  get  the  path  P  (figure  7.2).  The 
number  of  actions  applied  will  be  0(  \  P  j  + 1 P'  | )  =  0(\P\). 

Procedure  join-embeddings  takes  the  embeddings  of  the  biconnected  com¬ 
ponents  and  puts  them  together  to  get  an  embedding  for  G.  This  is  done  essen¬ 
tially  following  the  proof  of  theorem  6.1.  Find  a  nondominating  component  B. 
Recursively  embed  G‘  -  G  -  B.  Join  the  embeddings  of  B  and  G'  using  the 
shapes  U  or  1  as  shown  in  figure  6.2 

The  algorithm  can  be  shown  to  be  correct  using  the  material  developed  in 
the  previous  three  sections.  He  now  analyze  the  complexity  of  each  step  in  the 
algorithm  and  show  that  the  total  complexity  is  0{  |  F|e). 

Procedure  join-embeddings  updates  each  coordinate  at  most  once  per 
recursive  call.  The  total  number  of  calls  is  bounded  by  the  number  of  bicon¬ 
nected  components.  Hence  this  procedure  takes  0(  j  V\V  time. 

Procedure  get-long-path  can  be  implemented  to  run  in  0(|  F|)  time  each 
time  it  is  called.  Remember  that  we  can  get  the  faces  of  a  biconnected  graph 
from  the  testing  algorithm,  and  searching  all  faces  to  get  the  required  face 
takes  linear  time.  Procedure  find-path^in-embedding  takes  0(|Pj|)  =  0(|  V|) 
time.  These  two  procedures  will  be  invoked  at  most  0(|  V|)  time.  Hence  total 
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time  spent  in  these  calls  is  0(  |  VI8). 

Procedure  apply-ac tions-and-transform  applies  a  sequence  of  0(|P|) 
actions.  Each  edge  in  G  will  appear  in  only  one  such  path  P.  Hence  the  sum  of 
the  lengths  of  all  such  paths  P  is  0(|  V|).  Each  action  updates  at  most  0(|  V|) 
coordinates.  Therefore  the  time  spent  in  calls  to  this  procedure  is  0(  j  V|8). 

8.  Consistent  Rectilinear  Graphs 


Certain  rectilinear  graphs  cannot  be  drawn  on  the  grid  even  if  we  relax  the 
planarity  criterion.  Ve  say  that  a  rectilinear  graph  G(  V.r.X)  is  consistent  if  it 
can  be  drawn  on  the  grid  satisfying  the  ordering  relation  A.  In  other  words,  G  is 
consistent  if  the  set  of  equality  and  inequality  constraints  generated  in  part  1  of 
definition  2.2  is  consistent. 

The  equality  constraints  define  an  equivalence  relation  on  the  set  of  coordi¬ 
nates  of  the  vertices  of  G.  Let  us  denote  by  e  (s)  the  equivalence  class  contain¬ 
ing  the  coordinate  s.  Denote  by  /,  and  Iw  the  sets  of  x-coordinate  and  y- 
coordinate  inequality  constraints  respectively.  Construct  two  directed  graphs 
and  GyiVy.Ey)  as  follows: 

Vg  =  Je(x)  |  x  =  *(a),  a  e  VJ  and  Eg  =  I  *i  >*zel,\ 

Vv  and  Ey  are  similarly  defined. 

It  can  be  easily  shown  that  G  is  consistent  If  and  only  if  the  two  directed 
graphs  Gg  and  Gy  are  both  acyclic.  A  solution  to  the  coordinates  which  satisfies 
the  constraints  will  correspond  to  a  nonplanar  embedding  of  G  on  the  grid.  This 
can  be  obtained  by  performing  the  topological  sort  operation  [4]  on  the  two  acy¬ 
clic  digraphs.  In  fact  this  will  yield  a  solution  that  minimizes  the  area  of  the  rec¬ 
tangle  bounding  the  embedding. 

In  a  nonplanar  embedding  of  a  consistent  rectilinear  graph  on  the  grid,  all 
crossings  are  between  horizontal  edges  and  vertical  edges.  The  vertical  edges 
can  be  assigned  one  layer,  and  the  horizontal  edges  can  be  assigned  a  second 
layer.  In  other  words  the  'thickness'  [2]  of  a  consistent  rectilinear  graph  is  less 
than  or  equal  to  two.  A  generalization  of  the  rectilinear  graph  embedding  is  the 
problem  of  embedding  a  rectilinear  graph  with  layers  preassigned,  in  which  no 
two  edges  belonging  to  two  different  layers  cross.  This  remains  an  open  prob¬ 
lem. 

9.  Extensions  and  Open  Problems 

As  mentioned  in  the  previous  section,  the  embedding  problem  for  layer 
assigned  rectilinear  graphs  is  still  to  be  solved.  This  has  important  applications 
in  VLSI.  It  is  easy  to  show  that  if  a  rectilinear  graph  is  allowed  to  be  discon¬ 
nected,  then  the  optimal  area  embedding  problem  is  NP-complete  (  reduction 
from  two  dimensional  bin  packing  ).  However,  the  question  is  open  for  con¬ 
nected  rectilinear  graphs. 


On  a  more  theoretical  side,  we  can  define  'triangular',  'hexagonal'  and  other 
polygonal  graphs,  and  consider  the  embedding  problems  on  appropriate  grids. 
However,  it  seems  that  the  work  in  this  paper  does  not  generalize  easily.  This  is 
mainly  because  these  polygonal  graphs  lack  the  nice  simplification  properties  of 
rectilinear  graphs.  Geometry  seems  to  dominate  over  topology  in  these  polygon 
nal  graphs. 

Ve  conclude  with  a  note  that  our  0{  |  V\2)  embedding  algorithm  will  be 
implemented  in  AL1.  In  ALJ,  layouts  are  described  in  a  hierarchical  fashion,  and 
hence  the  algorithm  can  be  applied  hierarchically  on  a  cell  by  cell  basis.  In  this 
case  the  complexity  of  obtaining  the  embedding  is  the  sum  of  the  quadratic 
complexity  over  all  cell  instances. 
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Abstract  We  assume  that  long  wires  represent  large 
capacitive  loads,  and  investigate  tbe  effect  on  the  area  of 
a  VLSI  layout  when  drivers  are  introduced  along  many 
long  wires  in  the  layout.  We  present  a  layout  for  which 
the  introduction  of  drivers  along  long  wires  squares  the 
area  of  the  layout;  we  show,  however,  that  the  increase  in 
area  is  never  greater  than  this,  if  the  driver  can  be  laid 
out  in  a  square  region.  We  also  show  an  area-time  trade¬ 
off  for  a  single  long  wire  by  which  we  can  reduce  the  area 
of  its  driver  to  0(t*).q<l.  from  9(1).  if  we  can  tolerate  a 
delay  of  6(1 ,_*)  rather  than  6(log  I);  and  we  obtain  tight 
bounds  on  the  worst-case  area  increase  in  general  lay¬ 
outs  having  these  drivers,  using  the  Brouwer  fixed-point 
theorem.  We  also  derive  results  for  the  case  when  drivers 
are  embedded  in  rectangles  that  are  not  square,  finally, 
we  extend  tbe  use  of  our  upper-bound  technique  to  other 
layout  problems. 


1.  Introduction 

The  presence  of  long  wires  in  a  VLSI  layout  slows 
down  the  performance  of  the  circuit  due  to  signal  propa¬ 
gation  delays.  This  effect  can  be  minimized  by  using  a 
driver  at  the  bead  of  each  long  wire.  However,  the 
drivers  themselves  occupy  space,  and  the  question 
arises  whether  drivers  can  be  introduced  efficiently 
when  there  are  many  lone  wires  in  a  lavout.  For  exam¬ 
ple.  consider  the  simple  case  of  a  long  bidirectional' wire. 
A  driver  needs  to  be  introduced  at  each  and  of  tbe  wire, 
and  tbe  presence  of  each  driver  increases  the  length 
along  which  the  other  driver  has  to  drive  the  signal.  A 
large  network  with  many  long  wires  may  contain  several 
instances  of  mutual  interactions  of  this  and  other  kinds, 
and  it  is  not  clear  whether  drivers  can  be  introduced  in 
an  area-efficient  manner  under  these  conditions  (or  even 
whether  they  can  be  introduced  at  all).  This  is  the  prob¬ 
lem  we  analyze  in  this  paper. 

The  question  of  delays  along  long  wires  is  a  complex 
one,  and  no  consensus  has  been  reached  as  yet  on  the 
correct  way  of  modeling  this.  Some  papers  (e.g.,  [BKBO], 
[VBO])  assume  that  a  constant  propagation  delay  models 
the  current  situation  quite  well.  But  it  is  generally 
accapted  that  this  is  not  a  good  approximation  of  the 
physical  situation.  Mead  and  Conway  [MCBO]  and 
Chazelle  and  Monier  [CM81]  suggest  that  a  wire  is  basi¬ 
cally  a  distributed  RC-network.  and  the  delay  thus  is  pro¬ 
portional  to  the  square  of  its  length.  This  delay  can  be 
reduced  to  a  linear  delay  by  using  repeaters  along  the 
wire.  Linear  delay  is  also  tbe  asymptotic  limit  imposed 
by  the  speed  of  light.  Thompson  [T80,  TBl]  suggests  that 
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the  resistance  of  a  long  wire  can  be  ignored  and  that  the 
wire  can  be  viewed  as  a  large  capacitive  load  whose  capa¬ 
citance  is  proportional  to  its  length.  This  view  is  also 
presented  in  [MR79]  and  is  supported  by  Bilardi.  Precchi. 
and  Preparata  [BPPB1]  who  solved  the  diffusion  equation 
for  a  long  wire  and  auggest  that,  for  predicted  future 
gains  in  technology,  the  wire  can,  indeed,  be  modeled  as 
a  purely  capacitive  load.  For  such  a  load,  it  is  well-known 
that  the  delay  can  be  reduced  to  the  logarithm  of  the 
capacitance  by  introducing  a  sequence  of  drivers  which 
occupies  area  proportional  to  the  capacitance  [MCBG]. 

In  this  paper,  we  assume  that  the  wire  can  be 
modeled  as  a  purely  capacitive  load.  We  address  the  fol¬ 
lowing  question  that  arises  naturally  in  this  context,  but 
has  not  been  examined  before:  What  is  thi  tffte t  an  tho 
ores  of  a  layout  uAin  drivers  are  introduced  fo  speed  up 
signals  along  many  long  wires?  We  justify  our 
capacitive-model  assumption  by  appealing  to  the  simula¬ 
tion  results  of  [BPPB1].  Further,  if  a  long  wire  must, 
indeed,  be  modeled  as  an  RC-network.  then  tbe  introduc¬ 
tion  of  repeaters  to  reduce  tbe  delay  will  increase  the 
area  only  by  a  constant  factor.  This  is  because  such 
repeaters  are  of  constant  size,  and  a  long  wire  may  thus 
be  modeled  as  a  sequence  of  short  wires  connected  by 
nodes  of  fixed  size.  In  the  capacitive  model,  on  the  other 
hand,  the  area  occupied  by  a  driver  increases  with  the 
length  of  tbe  wire  it  drives,  and  thus,  drivers  cannot  be 
abstracted  as  constant-size  nodes.  In  fact,  we  show  that 
there  can  be  definite  area  penalties  when  there  are 
many  drivers  in  a  layout.  Our  results  are  of  particular 
significance  for  upper-bounds  in  area-time  products  for 
VLSI  layouts,  since  most  previously  derived  bounds  have 
ignored  the  delay  along  long  wires.  Either  linear  delay 
should  be  assumed  along  connecting  wires  (this  would 
represent  either  ”RC"  wires  with  repeaters,  or  "capaci¬ 
tive"  wires  without  drivers),  or  the  area  expansion 
caused  by  drivers  for  capacitive  wires  should  be  taken 
into  consideration;  alternatively,  some  intermediate 
design  for  drivers  can  be  used  from  the  spectrum  of 
designs  we  suggest  in  section  4.  But  upper-bounds 
derived  using  constant  delay  along  all  wires,  and  no  area 
expansion  for  drivers  do  not  model  the  physical  situation 
well. 

For  a  wire  of  length  1,  the  most  familiar  type  of 
driver  occupies  9(1)  area  while  reducing  the  delay  from 
6(1)  to  0(log  1).  In  this  paper,  we  present  a  layout  for 
which  the  introduction  of  such  drivers  along  long  wires 
•quarts  the  area  of  the  layout;  we  show,  however,  that 
tbe  increase  in  area  it  never  greater  than  this,  if  the 
driver  can  be  laid  out  in  a  square  region.  We  prove  tbe 
upper-bound  by  a  new  technique  that  uses  a  fixed-point 
theorem;  we  believe  tbe  proof  technique  is  important  in 
its  own  right.  We  also  show  an  area-time  trade-off  for  a 
■ingle  long  wire  by  which  we  can  reduce  tbe  area  of  its 
driver  to  9 (<f),q  <1.  from  9(1).  if  we  can  tolerate  a  delay 


of  8(1 '"V)  rather  then  8(log  l );  and  we  obtain  bound*  on 
the  wont-cate  area  increase  in  general  layouts  having 
these  drivers,  again  using  the  Brouwer  fixed-point 
theorem.  We  also  examine  the  ease  when  drivers  eannot 
be  laid  out  efficiently  in  square  regions. 

The  paper  is  organized  as  follows:  Section  2  presents 
the  VLSI  model  we  use.  In  section  3  we  present  a  graph 
on  n*  vertices  with  0(n)  long  wires  for  which  the  layout 
area  expands  from  8(n*)  to  8(n4)  after  introducing 
drivers  for  the  long  wires.  Section  4  examines  the  area- 
time  trade-off  for  drivers,  and  sections  5  and  6  use  the 
Brouwer  fixed-point  theorem  to  find  general  tight  upper- 
bounds  for  the  area  expansion  caused  by  driver*  in  arbi¬ 
trary  layouts.  Section  7  generalizes  the  technique  to 
other  layout  problems,  and  section  8  concludes  the 
paper  with  a  review  and  some  open  questions. 


r  The  Model 

We  assume  a  layout  model  similar  to  the  one  used 
by  Thompson[TBO]  and  Leiserson[L90].  The  VLSI  circuit  is 
abstracted  as  a  graph  having  bounded  vertex  degree  and 
it  is  embedded  on  a  planar  grid  subject  to  the  following 
constraints: 

1)  Etch  vertex  represents  a  processing  element  and 
occupies  a  constant  area.  Distinct  vertices  of  the  graph 
are  embedded  at  distinct  grid  intersection  points. 

2)  Edges  have  unit  width  and  are  routed  along  grid  lines 
with  the  restriction  that  no  two  edges  touch  one  another 
except  possibly  when  crossing  perpendicular  to  each 
other.  Also,  an  edge  cannot  be  routed  over  a  vertex  it 
does  not  connect.  Each  edge  represents  a  connecting 
wire.  We  will  refer  to  an  edge  as  a  si »ir*  or  a  tm . 

We  assume  that  we  are  given  such  a  layout  for  a  cir¬ 
cuit  with  certain  edges  identified  as  long  wires.  Note  that 
the  number  and  positions  of  long  wires  is  layout- 
dependent,  and  the  same  circuit  may  have  another  lay¬ 
out  with  shorter  lines.  We  shall  not  go  Into  the  question 
of  derigning  layouts  to  minimize  the  presence  of  long 
lines.  Some  work  has  been  done  on  minimizing  the  length 
of  the  longest  line  in  a  layout  for  certain  classes  of 
graphs  ([PRSBl],  [RSBl],  [BLB2]),  but,  as  we  mention  in 
section  8,  the  minimization  criterion  in  our  case  is  a 
different  one,  which  has  not  been  studied  so  far.  We  will, 
therefore,  assume  that  the  layout  is  given,  and  we  will 
Introduce  drivers  where  needed  by  making  local  expan¬ 
sions  without  distorting  the  layout  configuration. 

A  long  wire  is  a  bidirectional  element  electrically, 
but  in  a  VLSI  circuit,  it  usually  connects  active  unidirec¬ 
tional  devices,  eo  that  the  signal  always  originates  at  one 
fixed  end  of  the  wire  (which  we  will  call  the  Acad  of  the 
wire)  and  propagates  towards  the  other  end.  For  the  rest 
of  this  paper,  we  assume  that  wires  come  with  the  direc¬ 
tion  of  signal  flow  specified,  and  that  bidirectional  wires 
are  decomposed  (conceptually)  as  two  edges  connecting 
the  same  vertices,  but  having  opposite  signal  flow. 

As  mentioned  in  the  introduction,  we  assume  that 
the  line  is  purely  eapaeitive,  and  the  delay  thus  grows 
linearly  with  I.  the  length  of  the  line.  This  delay  can  be 
reduced  by  using  drivers  to  speed  up  the  signal.  Many 
different  typet  of  driver  designs  are  possible,  and  they 
differ  in  the  signal  speed-up  they  offer,  and  the  area  they 
occupy.  We  analyze  these  time-space  trade-offs  in  sec¬ 
tion  4.  For  the  present,  we  assume  the  most  familiar 
type  of  driver,  which  reduces  the  delay  to  8(Jog  f )  at  the 
expense  of  occupying  f(l)  area.  We  also  assume  that  the 
driver  can  be  laid  out  in  a  equare  of  side  AVI,  for  eome 


figure  1 

A  driver  introduced  along 
part  of  a  long  wire 

constant  k .  This  is  not  strictly  true,  since  the  driver  has 
a  transistor  whose  channel  is  8(1)  wide,  and  hence  it  is 
more  naturally  laid  out  in  a  long,  narrow  rectangle. 
Clever  design  methods,  however,  such  as  the  use  of  a 
zig-zag  poly  line  for  the  gate  in  MOS  technology,  can  be 
used  to  overcome  this  problem.  We  have  more  to  say  on 
this  point  in  section  6. 


figure  2 

Introducing  a  driver  by  making 
cuts  in  a  layout 


Given  a  long  tine  tv  of  length  I  in  a  layout,  an  area- 
sfficient  way  to  achieve  signal  speed-up  would  be  to 
introduce  a  long  skinny  driver  (the  natural  design)  along 
a  portion  of  w  (see  figure  1).  However,  VLSI  layouts  are 
dense  in  general,  and  the  above  scheme  will  not  work  if 
there  are  vertical  lines  present  in  this  region.  In  such  a 
ease,  we  make  two  "cuts"  or  "slices"  (a  notion  introduced 
by  Lsiserson  [L80])  at  the  bead  A  of  u  ,  each  *vT  wide, 
one  each  in  the  vertical  and  horizontal  directions  (figure 
2).  Any  edge  that  is  broken  by  the  cut  is  then  joined 
aeross  the  cut  by  a  straight  line.  This  forms  a  square 
region  of  ky/t  side  at  A  with  no  edges  or  vertices  from 
the  original  layout.  Wc  introduce  the  driver  in  this 
region.  Tbit  construction  does  not  disturb  the  layout 
configuration,  and  Introduces  the  driver  by  local  expan¬ 
sions.  It  increases  the  length  of  each  side  of  the  layout 
by  the  size  of  the  cut.  ft  alee  increases  by  k  VT .  f A* 
length  of  each  A  arisen!  of  wire  in  f  As  layout  which  inter¬ 
sects  (As  vertical  tins  drown  through  A  and  eocA  vertical 
win  that  Intersects  (As  A  orison!  al  Iwis  drawn  through  h 
These  other  wires  will  now  require  larger  drivers  then  the 
ones  they  would  have  needed  in  the  absence  of  a  driver 
at  A.  The  introduction  of  drivers  at  these  other  tines  will, 
tot  turn,  affect  the  lengths  of  more  lines,  and  could  even 
Ineraese  the  length  of  ui  so  that  the  area  of  its  driver 
would  hsve  to  be  revised  upward. 
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Ve  are  now  in  a  position  to  stste  the  problem  we  ere 
going  to  analyte: 

Given  a  VLSI  layout  unth  certain  edges  speeded  mt 
long  wires,  w/tof  is  tAe  increase  in  the  area  0/  Uu  layout 
in  /As  worst  case  wAsn  drivers  are  weed  to  speed  up  sig¬ 
nals  along  these  long  wires? 
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Consider  the  graph  layout  of  figure  3.  It  consists  of 
a  square  mesh  on  »*  vertices  (assume  n  even),  together 
with  n/2  long  borisontol  lines  w,.  wt,  -  -  -  ,w»/t.  If  we 
assume  the  vertices  to  be  laid  out  in  an  nan  grid  (there 
will  be  a  slight  expansion  needed  to  accommodate  the 
long  wires  along  grid  lines,  but  we  can  ignore  this  for 
“order-of-magnitude"  arguments}  and  let  m ■  n/2,  then 
w,  runs  from  vertex  (m.l)  to  vertex  (n,l).  wt  runs  from 
(m-1*)  to  (n-1,2),  ■  •  • ,  and  Wn  runs  from  (l.m)  to 
(m  +  l.m).  The  signal  goes  from  left  to  right  along  each  of 
these  of  these  lines. 
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figure  3 

A  graph  layout  with  9(n*)  area  and  ®(n)  long  lines 
of  length  6 (n)  each 


We  consider  now  the  increase  in  the  area  of  the  lay¬ 
out  when  we  introduce  drivers  on  wires  W|  to  wm.  Notice 
that  the  presence  of  the  underlying  square-mesh  struc¬ 
ture  forces  us  to  make  cuts  to  introduce  the  drivers. 
Since  n/2  drivers  are  introduced  diagonally,  and  the 
length  of  each  wire  is  6(n)  to  start  with,  the  length  of 
each  side  of  the  layout  increases  to  at  least  0(n  n l7f).  so 
that  the  area  of  the  layout  increases  from  0(n*)  to  at 
least  8{n*).  However,  this  does  not  account  for  the 
increase  in  the  lengths  of  some  wires  due  to  the  intro¬ 
duction  of  drivers  on  other  wires.  We  now  derive  a  non¬ 
linear  recurrence  relation  that  accounts  for  this  effect, 
and  use  it  to  bound  the  increase  in  area  as  ®(n4). 

Since  the  signal  travels  from  left  to  right  along  each 
of  the  long  wires,  the  length  of  W|  is  not  affected  by  the 
introduction  of  drivers  at  the  beads  of  the  other  long 
wires.  In  general,  the  length  of  w,  increases  by  exactly 
the  lengths  of  the  driver*  introduced  on  W|  to  w,.,.  If  we 
let  rf,  be  the  width  of  each  aide  of  the  driver  for  wire  w^, 
then  we  have 

d,*km 1/1 
gtsklm  +  rf,)1'1 


A.ekfm+^'d,) 

!•> 


t/s 


W  - 1  .'t1 


Let  s«  be  a  new  variable  defined  by 
S|Bm+rf,+d( 


f»i 

■Ot-t+dt 

■«*-»+*  >/stM** 

For  convenience,  define  *c*m .  Note  that  st+t  gives  the 
width  of  the  layout  from  the  rightmost  end  to  the  left 
end  of  the  <th  driver,  and  sM+m  gives  the  length  of  each 
side  of  the  layout  after  the  introduction  of  drivers  on  all 
of  the  long  wires.  We  find  bounds  for  this  value  below: 

Lomma:  There  exist  constants  k,  and  kt  such  that  for 
any  integer  m  >0. 

£|{*+m  fc  Si  *  k |t*+m  tor  <  *Vm . 

Proof  We  prove  both  sides  of  the  inequality  by  induction. 

Assume  s<k*ii,+m 

Since  s(«n,  the  assumption  holds  for  s 0  for  all  values  of 
k,.  Assume  the  result  bolds  up  to  *t.  Then, 


*,«,*  St+kViT 

Ok  ,t*+m  *k  (k  ,<*+m  )*' * 

>fc  ♦kk  }'H 

wk  ,(i + l)*+m +(fck -2fc ,  )i -fc , 

Hence,  Sf^km+k^i  +  l)*  for  k,<(fc*/4)  and  for 
sufficiently  large  i.  For  k  ,*(k1/  9),  the  result  holds  for 
all  values  oft.  So  we  have  stfc(k*/P)t*+m  for  all  tfcO. 

Assume  si«k(itvm 

Assume  the  result  bolds  up  to  Then. 


e*o»s,vfc>/sI 

pk  «t*+m  *k  (k  *<**m  ),/l 
*tk,i*vm  fk  ((k*v  l)t*),/*  for  tfcm 1/1 
Bfc,(i  +  l)*+m  -(2k*-k(fc*+l),'l<)trtt___ 
<*i(<4l)**m  for  k(k(k*/8)' (lWl  +  16/k*). 

Hence  we  have  st<(fc*/  B)(l+ Vl+l6/kl)t*+m  for 
tkVm.  For  i«Vm  ,  we  need  s>^*t jm +m .  end  since 
s«/r  <  m  »k  Vm  *Hk  jl  *k  >m ,  the  inequality  holds 
when  kfftk  VivEk  ♦2k*. 

Therefore,  Sj*kgt*vm  for  all  itVw  when 
k|»mai(  (k*/ B)(Vlvl6/T*) .  k'/T+StTSc*  ) 

The  required  result  follows  from  combining  the  two 
inequalities.* 

Thus,  the  width  of  the  layout  after  the  introduction 
of  drivers,  which  is  given  by  tm+m.  is  6(m ‘)=&(nl)  end 
the  ares  thus  grows  to  exactly  8(n4).  the  square  of  the 
initial  area.  The  width  of  the  driver  tor  u,  is  given  by: 

dtak(lTn+^d^)l/, 

*®(i ).  for  <»Vm. 

Hence  there  are  ®(n )  drivers  of  width  8(n )  in  this  lay¬ 
out.  Note  also  that  many  of  tbe  short  wires  have  their 
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length*  greatly  increased  by  the  introduction  of  drivers 
for  wires  ivt  to  win  -  For  instance,  the  wire  connecting  the 
vertex  (l.n)  to  the  vertex  (2,n)  has  its  length  increased 
from  1  grid  unit  to  9(»)  grid  units.  We  may  now  want  to 
introduce  drivers  for  these  wires,  too.  but  since  we  a 
deriving  a  lower  bound  in  this  section,  we  will  ignore  this 
point. 

The  layout  of  figure  3  is  simple  in  the  sense  that  it 
has  no  mutual  interaction:  there  do  not  exist  wires  U| 
and  Wj  such  that  the  driver  of  ui,  increases  the  length  of 
and  the  driver  of  increases  the  length  of  114.  In 
general,  we  could  have  a  more  complex  pattern  of  long 
wires  in  which  such  mutual  interactions  occur  in  many 
ways.  However,  in  section  8.  we  present  the  surprising 
result  that  the  penalty  in  area  never  becomes  worse 
than  the  square  of  the  original  area. 

4.  Area-Delay  Trade-ofT  for  a  Driver  of  a  Lnog  Wire 

When  a  large  capacitive  load  cj,  is  driven  by  an 
inverter  with  resistance  rt,  the  delay  is  the  product  rte  1 
(note  that  r,  here  refers  to  an  "average"  resistance, 
since  the  resistance  and  the  delay  vary  depending  on 
whether  the  input  is  low  or  high.  In  what  follows,  we 
make  some  other  simplifications  of  the  physical  situa¬ 
tion.  but  these  will  affect  the  constant  factor  only,  so 
that  the  "order-of-magnitude"  results  hold).  If  the  resis¬ 
tance  and  capacitance  of  a  minimum-size  inverter  are  rt 
and  ct  respectively,  then  the  basic  unit  of  delay  is  t*r,et 
and  this  represents  the  delay  when  a  minimum-size 
inverter  drives  the  gate  of  another  minimum-size 
inverter.  In  terms  of  this  unit  t .  a  minimum-size  inverter 
driving  a  load  capacitance  c*  causes  a  delay  of  cLi/ct, 
and  this  grows  linearly  with  the  load  capacitance.  When 
the  load  is  a  long  wire,  this  delay  grows  linearly  with  1, 
the  length  of  the  wire,  assuming  the  width  and  thickness 
of  the  wire  remain  fixed. 


Cl*/l  c, 


*«*/r/«e, 
*■*»»”•//«/ 1 


channel  width  of  inv.  1  «  / ,  (minimum  channel  width) 
channel  width  of  inv.  fs/t  (channel  width  of  inv.  1) 

figure  4 

Delay  on  one  stage  of  a  driver  sequence 

To  reduce  the  delay  to  more  acceptable  levels,  we 
use  a  sequence  of  drivers  with  successively  wider  chan¬ 
nels  to  drive  the  load  capacitance.  To  understand  why 
this  scheme  works,  consider  the  delay  when  one  inverter 
with  channel  width  / 1  times  the  minimum-size  inverter 
drives  the  gate  of  another  inverter  whose  channel  is  /t 
times  wider  than  that  of  the  driving  inverter  (see  figure 
4).  The  resistance  of  the  first  inverter  is  rt//t  and  the 
capacitance  of  the  second  inverter  is  /  i/Ki  end  hence 
the  delay  of  this  stage  is  /*< .  If  we  now  have  n  inverter 
stages,  starting  with  a  minimum-size  inverter  as  the  first 
stage,  a  driver  with  a  very  wide  channel  as  the  nth  stage 
to  drive  c *.  and  intermediate  stages  with  successively 
wider  channels,  then  the  total  delay  is  the  sum  of  the 
delay  through  the  n  stages.  Let  /,  be  the  ratio  of  the 
channel  width  of  the  ielst  inverter  to  that  of  the  <tb 


inverter  for  <■  1.2.  •  •  •  ,»-l.  and  let  /„  be  the  ratio  of  cL 
to  the  capacitance  c„  of  the  last  driver  (figure  S). 


channel  width 
(normalized)  l  / , 


/./*  /.-, 
(cap.  «  c„) 


£.‘ 


figure  5 

A  sequence  of  drivers  for  load  eL 

We  have  /,/,  •  •  •  and  the  total  delay  through 

the  n  stages  is  ■  •  •  ♦/»)!.  If  we  fix  each  /, 

to  be  equal  to  a  constant  /,  then  the  delay 
T-log^Ci/e, ),  and  this  is  the  minimum  delay  we  can 
obtain  up  to  a  constant  (see  [MCBO]).  However,  the  area 
occupied  by  the  drivers  in  this  case  grows  linearly  with 
ci.  as  we  show  below,  and  it  may  be  desirable  to  reduce 
this  area  penalty  at  the  expense  of  a  slightly  greater 
delay.  We  show  below  that  such  a  space-delay  trade-off 
exists.  Some  of  these  results  are  implicit  in  the  work  of 
Lin  and  Unholm  [LL75]  and  Jaeger[J75]. 

Let  the  area  occupied  by  a  minimum-size  inverter 
be  A-  Then,  an  inverter  whose  channel  is  /  times  as 
wide  as  a  minimum-size  inverter  requires  an  area  of 
about  /\ .  Hence  the  total  area  A,  occupied  by  the 
sequence  of  drivers  with  logarithmic  delay  is  given  by 

4*A.<  !♦/♦/*♦  •••♦/•“*) 

-©(/*) 

*®(Ci). 

Now,  consider  the  general  sequence  of  drivers  described 
above. -The  area  occupied  in  this  case  is 

*■*(*♦/  »♦/»/.♦  •  ♦/»/•  /*-!>• 

Assume  that  we  can  tolerate  a  delay  of  9(l*).0<q<l 
(*9((c^)f)).  Since  the  area  of  the  last  driver  dominates 
the  above  expression  for  area,  we  look  for  the  smallest 
size  for  the  last  driver  which  will  still  result  in  a  delay  of 
not  more  than  9(1*).  Since  c„* c*//,,  and  the  delay 
through  the  last  stage  is  /«< ,  the  best  we  can  do  is  to  set 
/*w9 (1*),  and  this  results  in  an  area  of  9(11'*)  for  c*. 
Now,  the  question  is  whether  we  can  accommodate  the 
ether  drivers  to  occupy  not  more  than  0ft1**)  area  and 
cause  no  greater  than  9(1*)  delay.  The  answer  to  this  is 
'yes’,  and  in  fact,  we  can  accomplish  this  in  many  ways. 
We  describe  one  method  of  doing  this,  which  minimizes 
the  constant  on  the  delay  term.  For  this,  we  make  t* 
the  load  so  that  we  need  to  drive  a  load  of  9 (/*“*).  We 
know  from  the  design  for  drivers  with  logarithmic  delay, 
that  this  can  be  done  w.th  9((l-q  )log  ()  delay  and  area 
9 (I1-*)  using  9((!og  r*/e,))  stages.  Hence  the  total  area 
for  the  n  stages  remains  0(1 ,_* )  while  the  delay  is  incre¬ 
mented  by  only  a  9(iog  1 )  term  and  hence  remains  9(1*  )■ 

We  have  thus  shown  the  following  interesting  area- 
delay  trade-off  for  the  driver  of  a  single  long  wire  of 
length  l:  For  any  jO<g<l,  u*  con  design  a  driver  with 
delay  9(1*)  and  area-delay  product  9(1)  If,  however.  we 
sued  to  reduce  the  data y  to  9(tog  1).  wht eh  it  the 
minimum  delay  achievable,  than  the  area-delay  product 
fc at  up  to  9(1  log  1). 
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6.  Ad  Upper-Bound  Proof  Technique  Using  nndfgisti 


Let  F(*,*t.  ■  ■  ■  ,*m)  be  •  mopping  from  IP  to 
defined  by 


F"(X|,  '  '  •*■«)*(/ »(*!•  '  '  !*•)'  "  ‘  •/■»(*»• 


where  each  /,  is  s  mopping  from  IP  to  k.  Then,  o 
fiztd  —point  of  F  is  t  rector  (xj.  -  •  •  ^sm)elP  such  that 


/•(*,.  •  •  •x.Wx,.  •  .x*).  La.. 

/,(*;.  *,)»*,,  for  <*1.  ,m. 


The  following  are  some  well-known  results  from  real 
analysis  (see  e.g.,  Bertie  [B64]). 


Brouwer  Fixed-Point  Theorem  Let  B> 0,  end  let  D  be  the 
hypersphere  defined  by  D  *  |xcR“:  |x  \*B\-  Then,  any 
continuous  function  with  domain  D  and  range  contained 
in  D  has  a  fixed-point. 


Pact  I  Any  real  continuous  function  on  a  bounded  closed 
set  in  HP  has  a  maximum  and  a  minimum. 


Pact  B  Tor  B>0.  the  hypersphere  defined  by  D  * 
}xeR*“:|x  \*B\  is  a  closed  bounded  set. 

We  will  need  the  following  two  definitions: 


Definition  1  Let  /(* -  -  .x*)  be  a  continuous  mapping 
from  HP  to  R  and  let  p  (x)  be  the  mapping  from  R*  to  R* 
such  tb&t  for  each  R* 0.  g(R)  gives  the  maximum  abso¬ 
lute  value  of  /  in  the  closed  hypersphere  of  radius 
Ri'm  .  We  call  g  the  bthavizr  function  of  / .  Note  that  by 
facts  1  and  2,  g  always  exists. 


Definition  2  A  function  g  (x )  from  R*  to  IP  is  svbiintar 
in  x  if  there  exists  an  raO  such  that,  for  all  x»r,  we  have 
g  (x  )sx .  We  call  r  a  breakpoint  of  the  function  g . 

We  now  prove 


Theorem  1:  Let  F  be  a  continuous  mapping  from  RP  to 
RP  given  by 

f(*|.  •  .**}*(/ 1<*|.  •*».).  ./«.(*!.  ■  •*»})• 

and  for  each  /,.  let  the  behavior  function  g,  be  eublinear 
in  x,  with  breakpoint  r%.  Then, 

1)  the  function  F  has  a  fixed-point  within  a  hypersphere 
of  radius  Ry/m  ,  where  R*m ax  r«,  and 

2)  each  component  of  the  fixed-point  is  no  greater  than 
R  in  magnitude. 


Proof  To  prove  1),  it  it  euSicient.  from  Brouwer'e 
theorem,  to  prove  that  the  range  of  F  ie  contained  on  or 
within  the  prescribed  hypersphere  when  the  domain  is 
similarly  constrained.  For  this,  it  is  sufficient  to  prove 
that,  for  each  i,  g<(R)*R,  since  this  will  guarantee  that 
the  Euclidean  length  of  the  range  is  no  greater  than 
yfmR  But  this  follows  immediately  from  definition  2, 
since  the  functions  gt  are  subUnear  with  breakpoints  r«, 
and  for  each  i.  r«*ff . 

Note  that,  by  requiring  g^R)*R  for  aacb  t,  we  are 
constraining  the  range  to  lie  within  a  dosed  hypercube 
inscribed  within  the  hypersphere  of  the  domain.  This 
additional  restriction  ensures  that  aacb  component  of 
the  fixed-point  ie  at  most  R  in  magnitude,  and  thie 
proves  2),  and  the  theorem  * 


In  our  application  to  layout  problems,  and  in  partic¬ 
ular,  to  the  problem  of  drivers,  we  will  be  interested  in  a 
special  data  of  functions  /,(x,.  -  -  •  ):R“-*R  defined 


/« (*  •  when  £bv*j)*0- 

/•>  l»I 

•0.  otherwise,  (1) 


where  k  and  4  are  positive  constants,  0<p<l,  and 
QctfqdAT.  for  some  positive  constant  K.  Usually,  is 
either  0  or  1.  For  such  functions,  we  have  the  following 
corollary  to  Theorem  1. 


Corollary  l.l:  Let  F  be  a  mapping  from  R*  to  R*  given 

Wy 

F(*\>  ■  ■  •*■*)*(/ •(*»•  •  ’  •*»«)• '  '  ./•*<*!■  ’  ’  .*»)). 

where  the  /t’e  are  given  by  (1).  Then  F  has  a  fixed-point, 
each  component  of  which  is  positive  and  has  magnitude 
no  greater  than  R.  where 

R  «max(k  ,(!,)>.  •  “-*>),  the  *,s 

being  positive  constants  independent  of  m. 


Proof  It  is  easy  to  prove  that  each  /«  bat  behavior  func¬ 
tion  gi(x)x/4(x.  -  -  -  jt).  Let  r(s(a  solution  of  the  equa¬ 
tion  A(4+£d„x)F*x).  Then,  since  pel.  p,  is  sublinear 


with  breakpoint  rt.  Hence  from  Theorem  1,  each  com¬ 
ponent  of  a  fixed-point  of  F  has  magnitude  no  more  than 
R.  where  R* max  rt. 


Cbsel  Assume  that  each  4*(£dq»»).  Then  we  require 


*<£««>>( ».y  *rt,  for  some  constant  c,<2k,  or. 


where  c'1*(c,)1/I1'*).  The  maximum 

possible  value  for  erch  Is  K.  Hence,  each  coordinnte 
of  the  fixed-point  is  guaranteed  to  have  a  value  no 

!;reeler  than  R*kKy^mF,it~f\  for  some  constant  *m4, 
that  depends  on  k  and  K). 


Che* 2  There  exist  4'*  such  that  each  4>(£a*yr*)- 


Let  4  he  the  maximum  value  of  such  4's.  Then 
£byV(B0(4),  and  hence  we  need  r,wfc,tf  tor  tome  eon- 

j»i 

atant  A,  <2*1.  Hence,  in  this  case,  each  coordinate  of  the 
fixed-point  it  guaranteed  to  have  a  value  at  most 

H«max(k4lf.km*,m»/l,-s)). 

By  combining  the  two  eases,  the  required  result  fol¬ 
lows.  Clearly,  since  the  range  of  F  lies  entirely  in  the 
positive  orthant,  each  component  of  the  fixed-point  must 
be  positive* 


Corollary  l.l:  If  at  most  r  of  the  4W  can  be  nonxero  for 
each  function  /,  in  (1),  then  each  component  of  a  fixed- 
point  of  F  is  positive  and  bar  magnitude  no  greater  than 
R.  where  ff«mex(k,  If,  .fc.lft.fc*,  ,**'<«-*»). 


Immediate  from  the  proof  of  Corollary  l.l* 


I.  General  Upper  Bounds  far  Arwa  Penalty  la  Layouts 
with  Driver* 


He  Initially  assume  that  a  driver  for  a  wire  of  length 
I  can  ba  embedded  in  a  aquara  of  aide  kl*  (p*  1/2). 
When  pal/ 2,  tbit  corresponds  to  the  standard  driver 
with  logarithmic  delay.  Sine*  w*  ar*  now  proving  upper 
bounds,  we  will  assume  alt  wires  in  the  layout  to  be  long 
This  may  ba  a  necessary  assumption,  since  a  wire  that 
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was  abort  initially  may  become  long  due  to  tbe  introduc¬ 
tion  of  drivers  on  other  wires  (as  we  taw  in  section  3). 

Now,  assume  that  the  layout  is  in  an  *tsn'  grid 
(nsft  ).  Hence  the  area  of  the  layout  before  introducing 
drivers  is  A*nn' .  We  note  the  following  points  about  long 
lines  and  their  drivers  in  this  layout: 

1)  If  each  long  line  is  l  units  long,  then  there  are  at  most 
24/ 1  long  lines  in  the  layout. 
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figure  6 

The  position  of  a  cut  for  a  driver 

2)  The  cut  perpendicular  to  the  direction  of  the  wire  is 
always  made  with  the  position  of  the  head  fixed,  and  on 
tbe  side  the  wire  extends  (figure  6).  Thus,  if  there  art 
two  wires  with  their  heads  along  the  same  vertical  (or 
horizontal)  line  and  with  opposite  signal  flow,  their 
drivers  do  not  overlap  the  same  horizontal  (or  vertical) 
region.  Since  this  simplification  at  most  doubles  the 
effect  of  drivers  at  any  single  vertical  or  horizontal  posi¬ 
tion.  the  "order-of-msgnitude"  results  remain 
unaffected.  The  cut  in  the  direction  parallel  to  the  wire 
is  made  on  the  side  of  the  wire  which  lies  on  e  clockwise 
rotation  from  the  wire  (see  again  figure  8).  With  this 
convention,  we  can  introduce  drivers  on  all  four  lines  at 
a  vertex,  if  needed. 


w, 

->- 


figure  7 

Tbe  effect  of  long  wires  with  their  with  heads  aligned 

3)  Consider  the  situation  in  figure  7.  All  of  the  lines 
,tut ,u/(  have  their  beads  along  the  vertical  line 
through  «.j.  and  the  signal  flow  on  each  wire  is  such  that 
a  cut  is  required  immediately  to  tbe  right  of  to 
accommodate  the  driver  for  each  wire.  In  such  a  ease, 
the  width  of  tbe  widest  cut  is  sufficient  to  include  all 


drivers.  A  similar  result  holds  for  horizontal  cuts.  Note 
that  this  means  that  we  need  make  at  most  two  cuts  at 
each  vertical  and  horizontal  grid  position  (tbe  factor  of 
two  arises  from  the  convention  we  adopted  in  observa¬ 
tion  2  above,  regarding  the  position  cf  the  cut).  Thus,  at 
most  2(n+n')  outs  will  be  needed  to  accommodate 
drivers  in  any  nxn’  layout.  The  vndthi  of  the  cuts  need 
to  be  determined,  and  we  bound  these  values  later  in 
this  section  to  obtain  tight  upper-bounds  on  the  worst- 
case  area  expansion  caused  by  the  introduction  of 
drivers. 

4)  If  a  wire  bends  (i.e.,  goes  both  vertical  and  horizontal), 
the  analysis  still  bolds.  We  assume,  however,  that  a  wire 
bends  at  most  a  constant  number  of  times,  and  hence  a 
wire  can  have  length  no  more  than  Kn,  for  some  con¬ 
stant  K. 

5)  If  a  wire  is  l  grid  units  long,  then  its  length  it  affected 
by  at  most  21  other  drivers.  This  worst  case  happens 
when  there  are  the  heeds  of  two  long  wires  (either  verti¬ 
cal  or  horizontal)  at  each  of  the  (—1  inner  grid  positions 
of  ui,  and  the  signal  flow  on  these  wires  is  directed  so 
that  cuts  are  required  on  both  sides  of  the  grid  position: 
and  there  is  a  long  wire  at  each  end  grid  position  of  u, . 
and  each  of  these  wires  requires  a  cut  on  the  tide 
affecting  the  length  of  wire  w .  This  situation  is  shown  in 
figure  B. 


Ill 


length  of  wire  w  *  3  units 
figure  B 

Worst-case  effect  of  other  drivers  on  the  lengtii  of  a  wire  m 
initially  k  grid  units  long 


Consider  a  general  layout  with  m  long  wires 
■  -  .in,*  of  initial  lengths  f|.i|.  ■  •  •  .1*.  As  men¬ 
tioned  earlier,  bidirectional  wires  are  treated  as  two 
separate  wires,  with  opposite  signal  flow.  Let  tbe  driver 
for  the  ftb  wire  be  embedded  In  a  square  of  side  d,.  We 
need  to  solve  for  the  4's.  <*1.2.  ,m.  Tor  this,  we 

note  that  tbe  length  of  the  ith  wire.  is  increased  to  a 
new  value  1/  by  tbe  presence  of  drivers  on  some  of  the 
other  wires,  i.e.,  we  have 

<»1.  •••.*». 

I«i 

where  6V  Is  nonzero  if  the  driver  of  wire  increases  the 
length  of  m».  and  is  0  otherwise.  Since  we  allow  a  wire  to 
bend  a  constant  number  of  times,  a  single  cut  can 
expand  a  wire  by  up  to  K  times  the  width  of  the  cut.  and 
hence  is  a  nonnegative  integer  having  value  no  more 
than  K .  Now,  the  driver  of  wt  will  increase  the  lengths  of 
all  horizontal  wires  that  intersect  the  vertical  line 
through  the  bead  of  and  all  vertical  wires  that  inter- 


Met  tbe  horizontal  Unc  through  the  heed  of  (see 
figure  9). 


6V  *  *  l.it}  *  l.itj  *0.6,j*0. 

The  eigne)  flow  can  be  in  either  direction  in 
each  of  wires  u,  low,. 

figure  9 
Determining  dv 

Hence  a  simple  preprocessing  of  tbe  layout  gives  the 
values  for  1,  -  -  ■  ,m.  Note  that  we  can  treat  both 

horizontal  and  vertical  wires  uniformly  because  of  our 
assumption  that  the  driver  occupies  a  square  region. 
Later,  we  will  need  to  modify  our  equations  when  we  con¬ 
sider  drivers  embedded  in  rectangles  that  are  not 
square.  Tbe  value  for  each  4  >■  given  by 

•m+Ewr  (2) 

Let  us  rewrite  equation  (2)  by  defining  d,  in  a  func¬ 
tional  form,  and  extending  its  domain  to  R"  as  follows: 

*»'/«(<!•  ■  <«1.  •  •  •.♦».  when 

*"  JM 

i*i 

*0  otherwise. 


become  greater  than  for  some  constant  c .  The 

maximum  width  occurs  in  tbe  case  when  i>9(n)  (since 
this  is  the  maximum  value  for  (,  by  observation  4).  and 
this  width  is  bounded  by  cn*/(>~*>.  The  worst-case  over  p 
occurs  when  p*l/2  and  in  this  case  the  width  is  0(n). 
But  we  know  from  the  example  of  eection  3  that  a  driver 
can  indeed  become  as  sride  as  8(n).  Hence  this  bound  is 
tight.  There  can  be  up  to  6(n)  wires  in  the  layout  (by 
observation  1).  and  hence  we  may  have  to  make  fi(n) 
cuts  in  each  of  the  two  directions,  so  that  tbe  layout  area 
will  increase  from  A«n*  to  B(n*)»0(4*).  This  upper 
bound  in  the  increase  in  area  is  again  matched  by  the 
lower  bound  of  section  3.  For  p<  1/2.  the  area  increases 
to  0((n*/<'-*>)t)*0(4*'<,-*)). 

If  <w6(nr).0<r<l,  then  the  width  of  each  driver  is 
0(n*»/(i-s)).  but  by  observation  1,  there  can  be  up  to 
such  lines  in  the  layout.  We  thus  have  more  than 
9(n )  drivers  in  this  case;  however,  by  observation  3.  if  we 
make  two  cuts  of  width  0(n*')  at  each  vertical  and  hor¬ 
izontal  grid  position  where  drivers  are  needed  (there  are 
at  most  4n  such  cuts),  then  we  can  accommodate  all 
drivers.  Each  side  of  the  layout  will  expand  to  at  most 
and  tbe  area  grows  to 
Since  r<l.  this  increase  is 
leas  than  the  increase  for  the  case  when  r*l. 

If  we  had  a  mixture  of  wires  of  varying  length,  the 
width  expansion  is  still  0(n1/(,",>)  by  the  same  argu¬ 
ment.  In  the  worst  case,  we  may  have  to  make  a  cut  for 
a  driver  at  each  side  of  each  vertical  and  horizontal  grid 
position.  There  are  4n  such  vertical  and  horizontal  grid 
positions,  and  the  worst  case  occurs  when  8(n)  cuts  of 
width  are  required  in  each  of  the  two  direc¬ 

tions.  We  have  thus  proved  the  following  theorem: 

Theorem  8:  If  tbe  driver  of  a  long  wire  of  length  1  can  be 
embedded  in  a  square  region  of  side  e(t*).0<pjEl/2. 
then  the  area  A  of  any  VLSI  layout  embedded  on  a  square 
grid  increases  to  at  most  D(A'/ll~*))  with  the  introduc¬ 
tion  of  such  drivers  along  long  wires. 

We  have  already  shown  that  this  bound  is  tight  when 
p  =  l/2  We  now  show  that  this  holds  for  anyp£l/2. 


Let  F{dt,  -  ■  ■  .dp, )  be  the  mapping  from  R*  to  defined 
by 

F(d„  ■  •  .dp ,)*(/, (d„  ■  .**).  •  •  •  ./m(d„  •  .dp,)). 

Then  the  solutions  of  the  system  of  equations  (2) 
correspond  exactly  to  the  fixed  points  of  the  function  F. 
i.e..  tc  the  values  of  d,.  •  -  -  .dp,  such  that 


Lemma  If  tbe  driver  of  a  long  wire  of  length  l  can  be 
embedded  in  a  square  region  of  side  B(i*).0<jt>«l/2, 
then  there  exists  a  VLSI  layout  whose  area  increases 
from  A  to  n(A,,(,**>)  when  such  drivers  are  introduced 
along  long  wires. 

Proof  We  use.  once  again,  the  layout  of  figure  3.  The 
recurrence  relation  for  *,  is  now 


/■(d,.  •  .d« ,)«(d,.  •  •  .O. 


From  the  results  of  the  previous  section,  we  can  now 
immediately  derive  upper  bounds  on  the  width  of  each 
driver.  We  first  determine  this  for  the  case  when  the  lay¬ 
out  is  in  a  square  nxn  grid.  We  later  show  that  this  is  the 
worst  ease. 

Assume  that  each  If  is  I  units  long,  and  hence,  from 
observation  5,  up  to  21  of  tbs  can  be  nonzero  for  each 
<•  Hence,  by  Corollary  1.2,  each  component  of  the  fixed- 
point  has  magnitude  no  greater  than  X .  where 

*.max(k,l*  •  •  ,  (21  P'C-r)) 

Hence  the  width  of  any  driver  in  this  layout  will  never 


with  tc*m*n/2.  As  before.  Sp,+m  gives  the  width  of 
each  side  of  the  layout  after  tbe  introduction  of  drivers. 
We  prove  by  induction  that  tm  *  Cl 
Assume  s,  a  c  •<,/l'*v4m. 

Tbe  result  bolds  for  eg  for  all  values  of  c.  Assume  that 
the  result  holds  up  to  s,.(-  Then. 

•c(«-l),'‘,-*4m+*(e(i-l),'<,**4m)»» 

hc«-l),'(,-*4m+kc^(i-l)►/<,-*, 

•e<i/f-*>((l-l/0>/<‘-*> 

+(fc/(e'-*i))(l-l/<)""-*>4m 
■et*/t,**)(l-l/t)>,(i***(lv((fc/  e,'*)“l)/<)vm. 


6- 


Since  ptfl/2,  we  have  p/(l-j>)*l,  and  bane* 
Hence, 

#,*«?<  ,/“-*>(l-l/i)(l+((*/e,-*)-l)/i)4m 
kcil/{l"*,vm  Wben  its  end  c«(4/ 3)>'<i-a). 

Vie  eleo  need  e|Sfn*£m*fec*m.  Hence, 
e,*ci,/','*,+*n  for  ell  <  end  for 
csmin(  (4/3)'/^,'*, .  4m*  ).  Set  m*l  In  the  min  expret- 
eion  to  obtein  a  value  for  e  that  la  independent  of  m . 

Tne  width  of  each  tide  of  tbit  layout  after  the  intro¬ 
duction  of  driver*  i*  s„4m*n(ml/(,"*))=f){Ti1/(,'*)).  end 
the  required  result  for  the  area  increase  follows* 

Next,  we  consider  the  cate  wben  the  driver  is 
embedded  in  a  rectangle  that  is  not  square.  In  this  cate, 
we  show  below  that,  in  the  worst  cate,  the  longer  aide  of 
the  driver  dominates  the  summation  in  equation  (2)  by 
proving  tbe  following  theorem: 

Theorem  3:  Assume  that  the  driver  for  wire  of  length 
4  can  be  embedded  in  a  rectangle  that  bat  length 
0,04,1*  along  tbe  direction  of  the  wire  end  width  bixktt{ 
ir,  the  perpendicular  direction.  If  0<*,f<l,  then  the 
worst-case  increase  in  tbe  area  A  of  a  VLSI  layout  on  a 
square  grid  is  max(9(A1'/,,'•,).9(A,/*,“,*))  when  such 
drivers  are  introduced  at  the  heads  of  long  wires. 

Proof  Since  the  driver  it  no  longer  symmetrical  in  the 
two  directions,  we  mutt  now  modify  the  equations  for  the 
/.*•  *° 

■.=/  . °m-6l &m)=*i(4  +  £ 

tZ' 

. . ,  (3) 

l*i 

where  the  domains  of  /  u  and  /*,  are  extended  to  R*“  as 
in  the  previous  case.  The  d|«'*  and  the  d^  's  are  deter¬ 
mined  as  before  (see  figure  9).  except  that  the  first  sub¬ 
script  of  1  refers  to  lines  parallel  to  wire  i,  and  that  of  2 
refers  to  lines  perpendicular  to  it.  Thus,  in  figure  0,  we 
have  the  following  values:  i|«y  =  l.  1. 

dlejsl.  6|Cj  =  0,  and  the  other  values  are  all  seres.  We 
now  require  the  fixed-point  of  the  function 

F(*\ <*■»•*! . &«)*(/  »./  It-—/  lm  ■/(!•/ tt /  Sm)» 

(where  the  arguments  on  each  fv  on  tbe  right  hand  tide 

are  a, . Om.bi . b».)  Again,  if  each  long  wire  is 

6(nr)  long  to  start  with,  we  obtain  from  Corollary  1.2, 
the  following  upper-bound  for  the  width  of  each  driver: 

ff«max(e(n»'f->),e(n'«/‘,-*>)) 

■9(n**/(,’*>).  where  p  >mat(i  ,f ). 

As  before,  the  worst  case  occurs  when  r*l,  and  this 
gives  us  a  maximum  width  of  0(n*/<’"*))  for  aach  driver 
along  its  longer  side.  We  still  may  need  B(n)  cuts  of  this 
width  in  aach  of  the  two  directions  to  introduce  tbe 
drivers,  and  this  gives  us  the  required  result  for  tbe 
maximum  increase  in  area* 

This  bound  it  once  again  tight,  and  tbe  lower  bound 
can  be  proved  using  the  layout  of  figure  3  with  n  /  2  verti¬ 
cal  long  lines  added  on  with  their  beads  along  a  diagonal. 
Note  that  in  this  case,  *  or  f  may  be  greater  than  1/2, 
and  so  the  area  blow-up  will  be  more  than  tbe  square  of 
the  original  layout.  In  particular,  if  pal,  the  above  bound 
ceases  to  bold.  (This  will  be  tbe  case  for  a  long  driver 
whose  area  is  proportional  to  the  length  of  the  wire  it 


drives,  i.a.,  the  natural  design  for  a  standard  driver.)  In 
this  case,  the  convergence  of  tbe  system  of  equations  (3) 
for  specific  layouts  depends  on  the  constant  4,.  If  A,fcl. 
then,  clearly,  drivers  cannot  be  introduced  in  any  layout 
containing  a  long  bidirectional  wire.  In  fact,  tor  any  fixed 
value  for  4,,  we  can  design  layouts  in  which  drivers  can¬ 
not  be  introduced,  as  we  show  in  the  following  theorem. 

Theorem  4:  If  the  driver  of  a  wire  of  length  1  must  be 
embedded  in  a  rectangle  whose  length  is  41,  for  some 
constant  4.  then  there  exist  layouts  in  which  drivers  can¬ 
not  be  introduced  on  all  tong  wires. 


figure  10 

A  layout  Id  prove  Theorem  4 

Proof  Consider  the  graph  of  figure  10.  It  is  the  same  as 
the  one  in  figure  3.  except  that  all  long  wires  are  b.direc- 
tional.  We  will  prove  that,  for  n»2/4,  drivers  cannot  be 
introduced  on  el)  long  lines  in  this  layout.  As  before,  let 
us  write  out  the  equation  for  each  d,.  Tor  instance, 

d*«4(lt4d|+dm»,+  ♦<*»). 

(where  m*(n/2)),  and  the  equations  for  the  other  d,'s 
can  be  written  out  by  inspection.  Since  this  is  a  system 
of  linear  equations,  this  can  be  rewritten  in  matrix  form 


where  A  is  an  nxn  matrix  of  the  coefficients  of  the  st,'s.  1 
is  a  vector  of  the  and  d  is  e  vector  of  the  d,‘s.  We 
need  e  solution  of  this  system  of  equations  with  each  d, 
nonnegative.  For  this,  we  note  that  the  matrix  A  is  an 
Jf -matrix  when  n  <(2/4 ),  and  from  the  theory  of  euch 
matrices,  it  is  simple  to  prove  that  the  system  of  equa¬ 
tions  has  o  solution  with  all  components  of  d  nonnegative 
only  when  n  <  2/4  (BP79).* 

For  particular  layouts,  we  may  still  obtain  conver- 
gence.  but  with  a  large  area  penally.  For  example,  the 
layout  of  section  3  (which  has  no  mutual  interaction 
between  drivers  and  wires)  will  require  area  n((l+4)m) 
after  the  Introduction  of  euch  drivers.  This  result  is 
easily  verified  from  the  recurrence  relation  for  *,  which, 
in  this  case,  is  ctw(l+4)*,.|. 

W*  pointed  out  in  section  4  that  tbe  driver  has  e  lest 
•tage  with  a  vary  wide  channel  that  dominates  the  area 
of  the  driver.  Thus  euch  a  driver  will,  indeed,  be  very- 
long,  and  have  constant  width.  Tbe  introduction  of  such 
a  driver  in  the  circuit  eon  be  disastrous  in  terms  of  area 
blow-up  (if  not  impotsible).  However,  it  ehould  be  potsi- 


ble  to  redesign  the  channel  to  occupy  •  more  square 
region,  possibly  with  some  penalty  in  the  total  area 
required.  Thus,  if  the  driver  originally  required  area 
0(1*).  then  this  design  might  result  in  a  driver  of  length 
0(1*)  and  width  0(1')  with  0{1**‘)>B(1»).  but  with 
CKs.f  <1  In  such  a  case,  drivers  can  always  be  intro¬ 
duced  in  any  layout  of  long  wires  with  a  polynomial  blow¬ 
up  in  area  given  by  Theorem  3  (the  degree  of  this  polyno¬ 
mial  will  be  very  high  if  s  or  t  is  close  to  1).  In  practice, 
we  expect  driver  designs  in  square  areas  with  no  extra 
area  penalty,  and  this  results  in  the  squaring  of  the  area 
in  the  worst  case. 

Finally,  we  look  into  the  case  of  a  rectangular  VLSI 
layout  in  an  nxn'  grid  («en  ).  As  before,  if  each  long 
wire  is  <  units  long,  then  the  width  of  any  driver  is 
(■-»))  by  Corollary  1.2.  By  observation  1.  the 
number  of  wires  in  the  layout  is  0(nn'/  l)*0(A/ 1'), 
where  A  is  the  area  of  the  layout  before  the  introduction 
of  drivers  If  |sO(n  ),  then  we  may  have  to  make  2n 
cuts  along  the  longer  side,  increasing  its  length  to 
0(n -is'li-a)),  and  similarly.  2n  cuts  along  the  shorter 
side,  increasing  its  length  to  0(n'  Hence  the 

area  increases  from  nn'  to  0(nn' 

If  l- O(n'),  then,  since  the  number  of  wires  in  the  layout 
is  0(nn'/l),  the  number  of  cuts  along  the  longer  side  is 
also  no  greater  than  0(nn‘  / 1 ).  so  that  the  length  of  that 
side  increases  to  0((nn'/f )l*/(,"*)).  The  increase  in 
length  of  the  shorter  side  remains  0(n .ja'l*-*)).  Hence 
the  area  increases  .  to  no  more  than 
0((nn'/f )  n'  since  l*0{n)  by 

observation  4.  Hence  the  worst  cose  area  expansion  for 
rectangular  layouts  due  to  the  introduction  of  drivers  is 
no  greater  than  that  for  square  ones 

To  summarize,  we  have  shown  that 

1)  if  the  drivers  described  in  section  4  can  be  embedded 
in  a  square  reg.on  without  extra  area  penalty,  then  in  the 
worst  case,  the  area  of  a  V151  layout  grows  quadratically 
with  the  introduction  of  drivers  for  long  wires.  This  worst 
case  occurs  when  drivers  require  area  proportional  to 
the  length  of  the  wire  they  drive  (i.e..  the  standard 
design);  when  the  layout  is  in  an  nxn  square  and  has 
0(n)  long  wires  of  length  B(n)  each;  and  when  the  intro¬ 
duction  of  drivers  requires  0(n )  cuts  of  width  0(n)  in  the 
layout  in  each  of  the  two  directions. 

2)  if  a  driver  requires  a  rectangular  embedding,  then  the 
worst  case  increase  in  area  is  0(.41/(l'*'),  where  the 
longer  side  of  a  driver  rectangle  for  a  wire  of  length  1  is 
0(1*)  grid  units  long  If  p*I,  then  there  exist  layouts  for 
which  drivers  cannot  be  introduced. 

T.  Application  bo  Other  Layout  Problems 

The  technique  developed  in  the  previous  two  sec¬ 
tions  can  be  used  in  other  related  problems  in  VLSI.  Con¬ 
sider  the  following  question: 

Given  a  VLSI  layout  what  is  tht  global  ojfaet  of  making 
many  local  spatial  expansions  that  mutually  interact’’ 

The  introduction  of  drivers  for  long  lines  is  one 
example  of  this  problem.  Another  example  is  fault- 
tclerant  computing  in  VLSI.  Von  Neumann[vN36]  has 
suggested  a  method  of  constructing  reliable  combina¬ 
torial  circuitry  using  components  that  have  a  certain 
probability  of  failure.  The  construction  requires  replica¬ 
tion  of  the  basic  components  many  times. This  method 


does  not  carry  over  directly  to  VLSI  circuits,  since  the 
connecting  wires  here  are  also  likely  to  fail,  and  the  pro¬ 
bability  of  failure  increases  with  the  length  of  the  wire. 
This  probability  can  be  reduced  by  increasing  the  width. 
It  is  generally  accepted  that  a  fault  can  occur  anywhere 
on  the  surface  of  a  silicon  wafer  with  a  constant  probabil¬ 
ity,  and  thus,  the  occurrence  of  a  fault  can  be  modeled 
as  a  random  variable  with  a  Poisson  distribution.  Under 
this  model,  it  can  be  shown  that  the  probability  of  failure 
along  a  wire  can  be  bounded  by  any  constant  c>0  by 
making  the  width  of  the  wire  proportional  to  the  log  of 
the  length.  But  if  the  width  is  increased  after  the  circuit 
is  laid  out  (and  this  is  the  only  logical  way.  since  the  the 
lengths  of  wires  in  the  layout  cannot  be  known  before  the 
circuit  is  laid  out),  then  this  will  result  in  increasing  the 
lengths  of  other  wires,  and  it  is  not  clear  what  the  global 
effect  of  many  such  local  transformations  is. 

We  analyze  the  general  problem  by  assuming  that, 
in  the  worst  case,  a  "cut"  (as  described  in  section  2) 
would  be  required  to  make  any  expansion  of  one  dimen¬ 
sion  of  an  element  in  the  layout.  Thus,  to  introduce  a 
driver  of  length  1  and  breadth  b  at  the  head  of  a  wire,  we 
would  require  a  cut  of  width  l  perpendicular  to  the  wire, 
and  a  cut  of  width  b,  parallel  to  the  wire,  both  at  the 
head  of  the  wire,  in  order  to  accommodate  the  driver.  Of 
course,  in  particular  layouts,  it  may  so  happen  that  com¬ 
ponents  are  sparse  at  this  region,  and  so  the  driver  can 
be  introduced  as  it  is  without  any  expansion  of  the  lev- 
out  But  we  are  looking  at  worst-case  situations,  and  we 
will  thus  assume  that  the  introduction  of  any  element 
can  be  achieved  only  by  making  cuts  of  the  appropriate 
widths.  Similarly,  for  wide  wires  in  fault-tolerant  com¬ 
puting,  the  width  of  a  wire  is  increased  to  w  by  making  a 
cut  of  width  v>  parallel  to  the  wire. 

Our  general  construction  is  as  follows: 

0)  Ley  out  the  circuit  (or  assume  that  the  circuit  layout 
is  given). 

1)  Number  the  spatial  elements  that  are  to  be  varied 
(usually  this  is  a  subset  of  the  lengths  and  widths  of  the 
lines,  or  the  dimensions  of  processing  elements  or 
driver's,  but  could  conceivably  include  other  objects)  as 
V|,  v*.  •  •  ■  ,vr  in  some  arbitrary  order. 

2)  For  each  spatial  element  v*.  identify  the  other  ele¬ 
ments  v,  that  are  made  to  expand  by  its  presence.  For 
each  such  element,  set  i^k,  where  v,  is  made  to 
expand  k  times  by  the  presence  of  v,. 

3)  The  following  system  of  equations  defines  the  final 
values  of  the  v,‘s: 

*.=/«(*»+£ **1.2.  ■■•.r, 

where  t,  is  a  constant  corresponding  to  the  initial  value 
of  some  linear  dimension  in  the  layout,  and  /,  deter¬ 
mines  the  functional  dependence  of  v,  on  the  v,'s.  If,  for 
each  /,,  the  behavior  function  g,  is  sublinear  with  break¬ 
point  ri,  then  an  upper  bound  for  the  v,'s  is  given  by  R. 
where 

ffrmu  rt. 

Examples: 

1)  For  drivers  embedded  in  a  square,  v,  is  the  length  of 
a  side  of  the  driver  of  the  ith  line,  and  1,  is  the  length  of 
the  ith  line,  and  /,(x)=t  x*.  for  some  constant  k .  and 
for  some  p<l. 
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2)  Tor  fault-tolerant  computing,  v,  is  the  width  of  the 
ith  line,  1*  is  its  length,  and  /,(*)**  In  x,  for  some  con¬ 
stant  k.  ■ 

8.  Conclusion 

y 

In  this  paper  we  have  analyzed  the  effect  of  intro¬ 
ducing  drivers  to  speed  up  signals  along  many  long  wires 
in  a  VLSI  layout.  We  have  shown  that,  under  all  but  the 
most  naive  of  designs  for  a  driver  (i.e.,  the  case  when 
drivers  have  constant  width,  and  occupy  area  propor¬ 
tional  to  the  length  of  the  wire  they  drive),  these  drivers 
can  be  introduced  with  only  a  polynomial  increase  in 
area.  With  the  additional  assumption  that  drivers  can  be 
embedded  in  a  square  region,  we  he ve  shown  that  the 
area  at  most  squares  by  their  introduction.  All  results 
-  have  matching  upper-  and  lower-bounds.  Wabave  also 
*-  shewn  an  area-delay  trade-off  in  the  design  of  drivers, 
and'  we  have  generalized  the  upper-bound  proof  tech¬ 
nique.  Some  open  problems  that  remain  are: 

1)  Civen  a  VLSI  circuit,  is  it  possible  to  design  a  layout 
for  it  so  that  the  presence  of  many  long  wires  is  "minim¬ 
ized"  according  to  a  suitably  defined  criterion?  Some 
work  has  been  done  on  minimizing  the  length  of  the  long¬ 
est  wire  in  a  layout  for  certain  classes  of  graphs.  [PRSBl] 
and  [RSSl]  have  derived  bounds  for  trees  and  [BL82] 
have  extended  the  result  to  classes  of  graphs  with 
separator  n,/*"',e>0  or  Vnlog*n,*»0.  Leiserson's  lay¬ 
out  technique  [L80]  minimizes  the  maximum  edgelength 
for  classes  of  graphs  with  separator  nT,  r>l/2.  Some 
lower-bound  results  are  presented  in  Leighton  [LSI). 
However,  our  results  on  drivers  indicate  that  it  is  not  the 
presence  of  a  single  long  wire  that  causes  the  worst-case 
increase  in  area,  but  rather  the  presence  of  many  long 
wires  in  a  configuration  that  requires  many  cuts.  Even 
heuristics  to  prevent  such  undesirable  configurations  of 
long  lines  should  be  very  useful. 

2)  Is  it  possible  to  incorporate  the  presence  of  drivers  in 
our  layout  model,  for  example,  by  using  nodes  of  variable 
size,  and  can  we  obtain  area-efficient  layouts  under  this 
model?  Using  such  a  model,  we  can  re-examine  the 
upper-bounds  that  have  been  obtained  for  the  layout 
area  of  many  common  circuits.  In  particular,  upper- 
bounds  for  AT°,a>0,  should  be  re-examined  under  the 
context  of  area-expansion  caused  by  drivers  of  long 
wires. 

3)  The  numerical  solution  of  equation  2:  In  section  6,  we 
used  the  Brouwer  fixed-point  theorem  to  prove  results 
on  the  existence  and  worst-case  behavior  of  the  solution 
of  equation  2  under  various  assumptions.  However,  the 
actual  solution  of  the  system  of  equations  for  particular 
layouts  still  needs  to  be  investigated. 

4)  Modeling  poly  lines  in  MOS  technology:  We  heve 
modeled  the  wire  as  a  purely  capacitive  load.  However, 
poly  lines  have  rather  high  resistance,  and  may  be 
better  modeled  as  distributed  RC-networks.  At  present, 
the  best  that  can  be  done  for  euch  structures  is  to  use  a 
driver  of  constant  size  after  each  fixed  interval  of  the 
wire  and  this  reduces  the  delay  from  8(1*}  to  8(1 ),  where 
1  is  the  length  of  the  poly  line.  It  will  be  very  useful  to 
find  a  better  way  to  speed  up  signals  along  such  wires. 
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